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Preface 

This book is about generating functions and some of their uses in 
discrete mathematics. The subject is so vast that I have not attempted to 
give a comprehensive discussion. Instead I have tried only to communicate 
some of the main ideas. 

Generating functions are a bridge between discrete mathematics, on 
the one hand, and continuous analysis (particularly complex variable the- 
ory) on the other. It is possible to study them solely as tools for solving 
discrete problems. As such there is much that is powerful and magical in 
the way generating functions give unified methods for handling such prob- 
lems. The reader who wished to omit the analytical parts of the subject 
would skip chapter 5 and portions of the earlier material. 

To omit those parts of the subject, however, is like listening to a stereo 
broadcast of, say, Beethoven’s Ninth Symphony, using only the left audio 
channel. 

The full beauty of the subject of generating functions emerges only 
from tuning in on both channels: the discrete and the continuous. See 
how they make the solution of difference equations into child’s play. Then 
see how the theory of functions of a complex variable gives, virtually by 
inspection, the approximate size of the solution. The interplay between the 
two channels is vitally important for the appreciation of the music. 

In recent years there has been a vigorous trend in the direction of 
finding bijective proofs of combinatorial theorems. That is, if we want to 
prove that two sets have the same cardinality then we should be able to 
do it by exhibiting an explicit bijection between the sets. In many cases 
the fact that the two sets have the same cardinality was discovered in the 
first place by generating function arguments. Also, even though bijective 
arguments may be known, the generating function proofs may be shorter 
or more elegant. 

The bijective proofs give one a certain satisfying feeling that one ‘re- 
ally’ understands why the theorem is true. The generating function argu- 
ments often give satisfying feelings of naturalness, and ‘oh, I could have 
thought of that,’ as well as usually offering the best route to finding exact 
or approximate formulas for the numbers in question. 

This book was tested in a senior course in discrete mathematics at the 
University of Pennsylvania. My thanks go to the students in that course for 
helping me at least partially to debug the manuscript, and to a number of 
my colleagues who have made many helpful suggestions. Any reader who 
is kind enough to send me a correction will receive a then-current complete 
errata sheet and many thanks. 

Herbert S. Wilf 
Philadelphia, PA 
September 1, 1989 



Preface to the Second Edition 


This edition contains several new areas of application, in chapter 4, 
many new problems and solutions, a number of improvements in the pre- 
sentation, and corrections. It also contains an Appendix that describes 
some of the features of computer algebra programs that are of particular 
importance in the study of generating functions. 

I am indebted to many people for helping to make this a better book. 
Bruce Sagan, in particular, made many helpful suggestions as a result of 
a test run in his classroom. Many readers took up my offer (which is now 
repeated) to supply a current errata sheet and my thanks in return for any 
errors discovered. 


Herbert S. Wilf 
Philadelphia, PA 
May 21, 1992 
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Chapter 1 

Introductory ideas and examples 

A generating function is a clothesline on which we hang up a sequence 
of numbers for display. 

What that means is this: suppose we have a problem whose answer is 

a sequence of numbers, ao, aq, 02 , We want to ‘know’ what the sequence 

is. What kind of an answer might we expect? 

A simple formula for a n would be the best that could be hoped for. If 
we find that a n = n 2 + 3 for each n = 0, 1, 2, . . ., then there’s no doubt that 
we have ‘answered’ the question. 

But what if there isn’t any simple formula for the members of the 
unknown sequence? After all, some sequences are complicated. To take 
just one hair-raising example, suppose the unknown sequence is 2, 3, 5, 7, 
11, 13, 17, 19, . . ., where a n is the nth prime number. Well then, it would 
be just plain unreasonable to expect any kind of a simple formula. 

Generating functions add another string to your bow. Although giv- 
ing a simple formula for the members of the sequence may be out of the 
question, we might be able to give a simple formula for the sum of a power 
series, whose coefficients are the sequence that we’re looking for. 

For instance, suppose we want the Fibonacci numbers F 0 , Fj, F 2 , . . ., 
and what we know about them is that they satisfy the recurrence relation 

Fn+i = F n + F n _ 1 (n > 1; F 0 = 0; Fj = 1). 

The sequence begins with 0, 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, ... There are 
exact, not- very-complicated formulas for F n . as we will see later, in example 
2 of this chapter. But, just to get across the idea of a generating function, 
here is how a generat-ingfunctionologist might answer the question: the 
nth Fibonacci number, F n , is the coefficient of x n in the expansion of the 
function x/(l — x — x 2 ) as a power series about the origin. 

You may concede that this is a kind of answer, but it leaves a certain 
unsatisfied feeling. It isn’t really an answer, you might say, because we 
don’t have that explicit formula. Is it a good answer? 

In this book we hope to convince you that answers like this one are 
often spectacularly good, in that they are themselves elegant, they allow 
you to do almost anything you’d like to do with your sequence, and gener- 
ating functions can be simple and easy to handle even in cases where exact 
formulas might be stupendously complicated. 

Here are some of the things that you’ll often be able to do with gener- 
ating function answers: 

(a) Find an exact formula for the members of your sequence. 

Not always. Not always in a pleasant way, if your sequence is 
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complicated. But at least you’ll have a good shot at finding such 
a formula. 

(b) Find a recurrence formula. Most often generating functions 
arise from recurrence formulas. Sometimes, however, from the 
generating function you will find a new recurrence formula, not 
the one you started with, that gives new insights into the nature 
of your sequence. 

(c) Find averages and other statistical properties of your se- 
quence. Generating functions can give stunningly quick deriva- 
tions of various probabilistic aspects of the problem that is repre- 
sented by your unknown sequence. 

(d) Find asymptotic formulas for your sequence. Some of the 
deepest and most powerful applications of the theory lie here. 
Typically, one is dealing with a very difficult sequence, and instead 
of looking for an exact formula, which might be out of the question, 
we look for an approximate formula. While we would not expect, 
for example, to find an exact formula for the nth prime number, 
it is a beautiful fact (the ‘Prime Number Theorem’) that the nth 
prime is approximately n log n when n is large, in a certain precise 
sense. In chapter 5 we will discuss asymptotic problems. 

(e) Prove unimodality, convexity, etc. A sequence is called uni- 
modal if it increases steadily at first, and then decreases steadily. 
Many combinatorial sequences are unimodal, and a variety of 
methods are available for proving such theorems. Generating func- 
tions can help. There are methods by which the analytic proper- 
ties of the generating function can be translated into conclusions 
about the rises and falls of the sequence of coefficients. When 
the method of generating functions works, it is often the simplest 
method known. 

(f) Prove identities. Many, many identities are known, in combina- 
torics and elsewhere in mathematics. The identities that we refer 
to are those in which a certain formula is asserted to be equal 
to another formula for stated values of the free variable(s). For 
example, it is well known that 



One way to prove such identities is to consider the generating 
function whose coefficients are the sequence shown on the left side 
of the claimed identity, and to consider the generating function 
formed from the sequence on the right side of the claimed identity, 
and to show that these are the same function. This may sound 
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obvious, but it is quite remarkable how much simpler and more 
transparent many of the derivations become when seen from the 
point of view of the black belt generatingfunctionologist. The 
‘Snake Oil’ method that we present in section 4.3, below, explores 
some of these vistas. The method of rational functions, in section 
4.4, is new, and does more and harder problems of this kind. 

(g) Other. Is there something else you would like to know about- 
your sequence? A generating function may offer hope. One ex- 
ample might be the discovery of congruence relations. Another 
possibility is that your generating function may bear a striking 
resemblance to some other known generating function, and that 
may lead you to the discovery that your problem is closely related 
to another one, which you never suspected before. It is noteworthy 
that in this way you may find out that the answer to your prob- 
lem is simply related to the answer to another problem, without- 
knowing formulas for the answers to either one of the problems! 

In the rest of this chapter we are going to give a number of examples 
of problems that can be profitably thought about from the point of view 
of generating functions. We hope that after studying these examples the 
reader will be at least partly convinced of the power of the method, as well 
as of the beauty of the unified approach. 

1.1 An easy two term recurrence 

A certain sequence of numbers ao, aq, . . . satisfies the conditions 

«n+i = 2o n + 1 (n>0;a o = 0). (1-1.1) 


Find the sequence. 

First try computing a few members of the sequence to see what they 
look like. It begins with 0, 1, 3, 7, 15, 31, . . . These numbers look sus- 
piciously like 1 less than the powers of 2. So we could conjecture that 
a n = 2 n — 1 (n > 0), and prove it quickly, by induction based on the 
recurrence (1.1.1). 

But this is a book about generating functions, so let’s forget all of 
that, pretend we didn’t spot the formula, and use the generating function 
method. Hence, instead of finding the sequence {a n }, let’s find the gener- 
ating function A(x) = Yhn>o a nX n ■ Once we know what that function is, 
we will be able to read off the explicit formula for the a n ’s by expanding 
A(x) in a series. 

To find A(x), multiply both sides of the recurrence relation (1.1.1) by 
x n and sum over the values of n for which the recurrence is valid, namely, 
over n > 0. Then try to relate these sums to the unknown generating 
function A(x). 
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If we do this first to the left side of j(l. l.l) , there results ^ ~2 n>0 a n +ix n . 
How can we relate this to A(x)7 It is almost the same as A(xJ. But the 
subscript of the ‘o’ in each term is 1 unit larger than the power of x. But, 
clearly, 


y] a n +ix n = ai + a 2 X + a 3 x 2 + a^x 3, + • • • 

n> 0 

= {(a 0 + aqx + a 2 x 2 + a 3 x 3 H ) - a 0 }/x 

= A(x)/x 


since ao = 0 in this problem. Hence the result of so operating on the left 
side of |( 1.1.1) |is A(x)/x. 

mi 


Next do the right side of 
n > 0. The result is 


Multiply it by x n and sum over all 


o n + l)x n = 2 A{x) + x n 

n > 0 n> 0 

= 2A(x) + -2_, 

1 — X 


wherein we have 
1/(1 — x), which 
If we equate 


used the familiar geometric series evaluation ^ 
is valid for lx I < 1. 


n> 0 


X 


the results of operating on the two sides of (1.1.1) , we find 


that 


A(x) 

x 


2 A{x) + 

1 — x 


which is trivial to solve for the unknown generating function A(x). in the 
form 

= (1 — x)(l — 2x) ' 

This is the generating function for the problem. The unknown numbers 
a n are arranged neatly on this clothesline: a n is the coefficient of x n in the 
series expansion of the above A[x). 

Suppose we want to find an explicit formula for the o n ’s. Then we 
would have to expand A(x) in a series. That isn’t hard in this example, 
since the partial fraction expansion is 


x f 2 1 ) 

(1 — x)(l — 2x) X \l — 2x 1 — xj 

= {2x + 2 2 x 2 + 2 3 x 3 + 2 4 x 4 + • • •} 

— {x + x 2 + x 3 + x 4 + ■ • 

= (2 - l)x + (2 2 - l)x 2 + (2 3 - l)x 3 + (2 4 - l)x 4 + • ■ ■ 


It is now clear that the coefficient of x n , i.e. a n , is equal to 2 n — 1, for each 
n > 0. 
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In this example, the heavy machinery wasn’t needed because we knew 
the answer almost immediately, by inspection. The impressive thing about 
generatingfunctionology is that even though the problems can get a lot 
harder than this one, the method stays very much the same as it was here, 
so the same heavy machinery may produce answers in cases where answers 
are not a bit obvious. 


1.2 A slightly harder two term recurrence 

A certain sequence of numbers ao, ai, . . . satisfies the conditions 

a n - |_i = 2 a n + n (n > 0; ao = 1). (1.2.1) 

Find the sequence. 

As before, we might calculate the first several members of the sequence, 
to get 1, 2, 5, 12, 27, 58, 121, ... A general formula does not seem to be 
immediately in evidence in this case, so we use the method of generating 
functions. That means that instead of looking for the sequence ao, oj, . . ., 
we will look for the function A(x) = Ylj>o a j x ^ ■ Once we have found the 
function, the sequence will be identifiable as the sequence of power series 
coefficients of the function.* 

As in example~T| , the first step is to make sure that the recurrence 
relation that we are trying to solve comes equipped with a clear indication 
of the range of values of the subscript for which it is valid. In this case, the 
recurrence (1.2.1) is clearly labeled in the parenthetical comment as being 
valid for n — 0, 1, 2, . . . Don’t settle for a recurrence that has an unqualified 
free variable. 

The next step is to define the generating function that you will look for. 
In this case, since we are looking for a sequence ao, ai, 02 , . . . one natural 
choice would be the function A(x) — 0 a ;/ x J that we mentioned above. 

Next, take the recurrence relation (1.2.1), multiply both sides of it by 
x n , and sum over all the values of n for which the relation is valid, which, 
in this case, means sum from n = 0 to 00 . Try to express the result of doing 
that in terms of the function A(x) that you have just defined. 

If we do that to the left side of (1.2.1), the result is 

ai + a 2 X + a^x 2 + a^x 2, + • ■ ■ = (A(x) — ao)/x 

= (A(x) — l)/x. 

So much for the left side. What happens if we multiply the right side 
of (1.2.1) by x n and sum over nonnegative integers n? Evidently the result 
is 2 A{x) + E „> 0 nx n . We need to identify the series 


E 

n> 0 


nx 


x + 2ar + 3ar + 4x 4 H 


* If you are feeling rusty in the power series department, see chapter 2, 
which contains a review of that subject. 
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There are two ways to proceed: (a) look it up (b) work it out. To work it 
out we use the following stunt, which seems artificial if you haven’t seen it 
before, but after using it 4993 times it will seem quite routine: 


Y j nx n = Y J 


n> 0 


n> 0 


, d . r 
x(—)x 
ax 


d 




X 


1>0 


x(—) — 
ax 1 — x 


x 


(1-x) 2 ' 

( 1 - 2 - 2 ) 

In other words, the series that we are interested in is essentially the 
derivative of the geometric series, so its sum is essentially the derivative 
of the sum of the geometric series. This raises some nettlesome questions, 
which we will mention here and deal with later. For what values of x is 
(1.2.2) valid? The geometric series converges only for |x| < 1, so the ana- 
lytic manipulation of functions in (1.2.2) is legal only for those x. However, 
often the analytic nature of the generating function doesn’t interest us; we 
love it only for its role as a clothesline on which our sequence is hanging 
out to dry. In such cases we can think of a generating function as only a 
formal power series, i.e., as an algebraic object rather than as an analytic 
one. Then (1.2.2) would be valid as an identity in the ring of formal power 
series, which we will discuss later, and the variable x wouldn’t need to be 
qualified at all. 


Anyway, the result of multiplying the right hand side of (1.2.1) by x n 
and summing over n > 0 is 2 A(x) + x/(l — x) 2 , and if we equate this with 
our earlier result from the left side of (1.2.1), we find that 


( A (X) - 1 ) = 2A(x) + X 


X 


(i — xy 


(1.2.3) 


and we’re ready for the easy part, which is to solve (1.2.3) for the unknown 
A(x ), getting 

. . . 1 — 2x + 2x 2 . 

A(x) = ( i-x)°(i-2*r ( ’ 

Exactly what have we learned? The original problem was to ‘find’ the 
numbers {a n } that are determined by the recurrence (1.2.1). We have, in 
a certain sense, ‘found’ them: the number a n is the coefficient of x n in the 
power series expansion of the function (1.2.4). 

This is the end of the ‘hnd-the-generating-function’ part of the method. 
We have it. What we do with it depends on exactly why we wanted to know 
the solution of 


1 . 2.1 


in the first place. 

Suppose, for example, that we want an exact, simple formula for the 
members a n of the unknown sequence. Then the method of partial fractions 
will work here, just as it did in the first example, but its application is now 
a little bit trickier. Let’s try it and see. 

The first step is to expand the right side of (1.2.4) in partial fractions. 
Such a fraction is guaranteed to be expandable in partial fractions in the 
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form 

1 — 2x + 2x 2 _ A B C , , 

(1 — x) 2 (l — 2x) ~ (1 - xf + + l^k? ( ' ' 5j 

and the only problem is how to hnd the constants A , B , C . 

Here’s the quick way. First multiply both sides of (1.2.5) by (1 — x) 2 , 
and then let x = 1. The instant result is that A = — 1 (don’t take my word 
for it, try it for yourself!). Next multiply (1.2.5) through by 1 — 2x and 
let x = 1/2. The instant result is that (7 = 2. The hard one to hnd is £>, 
so let’s do that one by cheating. Since we know that (1.2.5) is an identity, 
i.e., is true for all values of x. let’s choose an easy value of x. say x = 0, 
and substitute that value of x into (1.2.5). Since we now know A and C, 
we hnd at once that B = 0. 

We return now to (1.2.5) and insert the values of A. B. C that we just 
found. The result is the relation 


1 — 2x + 2x 2 _ (-1) 2 

(1 - x) 2 (l - 2x) ~ (1 - xf + 1 -2z' 


( 1 . 2 . 6 ) 


What we are trying to do is to hnd an explicit formula for the coefficient 
of x n in the left side of (1.2.6). We are trading that in for two easier 
problems, namely finding the coefficient of x" in each of the summands on 
the right side of (1.2.6). Why are they easier? The term 2/(1 — 2x), for 
instance, expands as a geometric series. The coefficient of x n there is just 
2 ■ 2 n = 2 n+1 . The series ( — 1) /( 1 — x) 2 was handled in (1.2.2) above, and 
its coefficient of x n is -(n f 1). If we combine these results we see that our 
unknown sequence is 


a n = 2 n+l -n~l (n = 0,1,2,...). 


Having done all of that work, it’s ti me to con fess that there are better 
ways to deal with recurrences of the type (1.2.1). without using generating 
functions.* However, the problem remains a good example of how gener- 
ating functions can be used, and it underlines the fact that a single unified 
method can replace a lot of individual special techniques in problems about 
sequences. Anyway, it won’t be long before we’re into some problems that 
essentially cannot be handled without generating functions. f§ 


It’s time to introduce some notation that will save a lot of words in 
the sequel. 

Definition. Let fix) be a series in powers of x. Then by the symbol 
[x n ]/(x) we will mean the coefficient of x n in the series f(x). 

Here are some examples of the use of this notation. 


[x n ]e x = 1/n!; [t r ]{l/(l - 3f)} = 3"; [«”>](! + u)’ = 7 V 


See, for instance, chapter 1 of my book 


[Wi2] 
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A perfectly obvious property of this symbol, that we will use repeatedly, is 

[x n ]{x a f(x)} = [x n ~ a ]f(x). (1.2.7) 

Another property of this symbol is the convention that if f3 is any real 
number, then 

[(5x n ]f(x) = (1 /[3)[x n ]f{x) : (1.2.8) 

so, for instance, [x n /n\]e x = 1 for all n > 0. 

Before we move on to the next example, here is a summary of the 
method of generating functions as we have used it so far. 

THE METHOD 

Given: a recurrence formula that is to be solved by the method of 
generating functions. 

1. Make sure that the set of values of the free variable (say n ) for 
which the given recurrence relation is true, is clearly delineated. 

2. Give a name to the generating function that you will look for, and 
write out that function in terms of the unknown sequence (e.g., 
call it A(x), and define it to be J2 n>0 a n x n ). 

3. Multiply both sides of the recurrence by x n , and sum over all 
values of n for which the recurrence holds. 

4. Express both sides of the resulting equation explicitly in terms of 
your generating function A(x). 

5. Solve the resulting equation for the unknown generating function 
A(x). 

6. If you want an exact formula for the sequence that is defined by 
the given recurrence relation, then attempt to get such a formula 
by expanding A(x) into a power series by any method you can 
think of. In particular, if A(x) is a rational function (quotient 
of two polynomials), then success will result from expanding in 
partial fractions and then handling each of the resulting terms 
separately. 


1.3 A three term recurrence 

Now let’s do the Fibonacci recurrence 

F n - i-i = F n + F n _ i. (n > 1; F 0 = 0; F\ = 1). 
Following ‘The Method,’ we will solve for the generating function 

F(x) = Y J F n x n . 

n> 0 


(1.3.1) 
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To do that, multiply (1.3.1) by x n . and sum over n > 1 . We find on the 
left side 


F 2 x + F 3 x 2 + F 4 x 3 = 

and on the right side we find 


F(x) — x 


x 


{F 1 x + F 2 x 2 + F 3 x 3 -I } + {F 0 x + Fix 2 + F 2 x 3 -\ } = {F(x)} + {a:F(x)}. 


(Important: Try to do the above yourself, without peeking, and see if you 
get the same answer.) It follows that {F — x)/x — F + xF , and therefore 
that the unknown generating function is now known, and it is 


F{x) 


x 

1 — x — X 2 


Now we will find some formulas for the Fibonacci numbers by expand- 
ing x/{\ — x — x 2 ) in partial fractions. The success of the partial fraction 
method is greatly enhanced by having only linear (first degree) factors in 
the denominator, whereas what we now have is a quadratic factor. So let’s 
factor it further. We find that 


1 — x — x 2 = (1 — xr + )(l — xr _) (r± = (1 ± \/5)/2) 


and so 


x 

l — x — x 2 


x 

(1 — xr + )(l — xr-) 


1 


1 


1 


(r_|_ — r_ ) \ 1 — xr . |_ 1 — xr- 



t.hanks to the magic of the geometric series. It is easy to pick out the 
coefficient of x n and find 


F n = - r n _) 


(n = 0,1,2,...) 


(1.3.3) 


as an explicit formula for the Fibonacci numbers F n . 

This example offers us a chance to edge a little further into what gen- 
erating functions can tell us about sequences, in that we can get not only 
the exact answer, but also an approximate answer, valid when n is large. 
Indeed, when n is large, since r + > 1 and |r_| < 1, the second term in 
(1.3.3) will be minuscule compared to the first, so an extremely good ap- 
proximation to F n will be 


F n 


J_fl + V5 




2 


(1.3.4) 
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But-, you may ask, why would anyone want an approximate formula 
when an exact one is available? One answer, of course, is that sometimes 
exact answers are fearfully complicated, and approximate ones are more 
revealing. Even in this case, where the exact answer isn’t very complex, we 
can still learn something from the approximation. The reader should t ake 
a few moments to verify that, by neglecting the second term in (1.3.3)[ we 
neglect a quantity that is never as large as 0.5 in mag nitude, and conse- 

, it is exactly equal 
consideration of an 


quently not only is F n approximately given by (1.3.4) 


to the integer nearest to the right side of (1.3.4) Thus 


approximate formula has found us a simpler exact formula! 


1.4 A three term boundary value problem 

This example will differ from the previous ones in that the recurrence 
relation involved does not permit the direct calculation of the members 
of the sequence, although it does determine the sequence uniquely. The 
situation is similar to the following: suppose we imagine the Fibonacci re- 
currence, together with the additional data Fq = 1 and T 735 = 1. Well then, 
the sequence {F n } would be uniquely determined, but you wouldn’t be able 
to compute it directly by recurrence because you would not be in possession 
of the two consecutive values that are needed to get the recurrence started. 

We will consider a slightly more general situation. It consists of the 
recurrence 

au n+ i + bu n + cu n -i = d n (n = 1, 2, . . . , N - 1; u 0 = u N = 0) (1.4.1) 

where the positive integer A, the constants a, 6 , c and the sequence {d n }^T 1 1 
are given in advance. The equations (1.4.1) determine the sequence 
uniquely, as we will see, and the method of generating functions gives us a 
powerful way to attack such boundary value problems as this, which arise 
in numerous applications, such as the theory of interpolation by spline func- 
tions. 

To begin with, we will define two generating functions. One of them is 
our unknown U (x) = XljLo u j x ^ 1 and the second one is D(x) — Yl!j=i djxF, 
and it is regarded as a known function (did we omit any given values of the 
dj’s, like do? or cbv? Why?). 

Next, following the usual recipe, we multiply the recurrence (1.4.1) by 
x n and sum over the values of n for which the recurrence is true, which in 
this case means that we sum from n = 1 to N — 1. This yields 

N- 1 N-l N-l N-l 

a ^2 u n+lX n -I- b *22 u nX n + c ^ 2 u n-lX n = ^ d n X U . 
n = 1 n= 1 n= 1 n= 1 

If we now express this equation in terms of our previously defined generating 
functions, it takes the form 

— {U(x) — u\x} + bU{x ) -|- cx{U(x) — = D{x). (1-4.2) 

oc 
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Next, with only a nagging doubt because m and un-i ar e unkno wn, we 
press on with the recipe, whose next step asks us to solve (1.4.2) for the 
unknown generating function U(x). Now that isn’t too hard, and we find 
at once that 


{a + bx + cx 2 }U (x) = x{D(x ) + am + cun-\x n }. (1.4.3) 

The unknown generating function U(x) is now known except for the 
two still-unknown constants m and un - i • but (1.4.3) suggests a way to find 
them, too. There are two values of x, call them r + and r_, at which the 
quadratic polynomial on the left side of (1.4.3) vanishes. Let us suppose 
that 7 ^ rW for the moment. If we let x = r + in (1.4.3), we obtain 
one equation in the two unknowns m, un - i , and if we let x = r_, we get 
another. The two equations are 

am + (cr^ )u N - 1 = -D(r+) 
au i -I- ( cr N )uN-i = —D{rJ). 

Once these have been solved for m and un- i, equation (1.4.3) then gives 
U (x) quite explicitly and completely. We leave the exceptional case where 

r N _ r N ^- 0 rea( ] er 

Here is an application* of these results to the theory of spline interpo- 
lation. 

Suppose we are given a table of values yo,yi, . . . , y n of some function 
y(x), at a set of equally spaced points U = to + ih (0 < i < n) . We want to 
construct a smooth function S(x) that fits the data, subject to the following 
conditions: 

(i) Within each interval (f^, t/+i) (i = 0, . . . , n — 1) our function S(x) is 
to be a cubic polynomial (a different one in each interval!); 

(ii) The functions S(x), S'(x ) and S"(x) are to be continuous on the 
whole interval [to,t n ]; 

(hi) S(U) = yi for i = 0, . .. ,n. 

A function S(x) that satisfies these conditions is called a cubic spline. 
Suppose our unknown spline S(x) is given by So(x), if x G Si(x), if 

x G [t i , ^ 2 ] 5 - • •, S n -i(x), if x G [t n _i,t n ], and we want now to determine all 
of the cubic polynomials Sq, ■ ■ ■ , SVi-i- To do this we have 2 n interpolatory 
conditions 


Si — 1 (^i) yi Si (ti) {i 1 , . . . , 71 l), So (to) Vo i S n —\ ( t n ) y n 

(1.4.5) 

along with 2n — 2 continuity conditions 

S'i-i(ti) = S'iiU); S^iU) = S"(U) (i = l,...,n- 1). (1.4.6) 

* This application is somewhat specialized, and may be omitted at a first 
reading. 


12 


1 Introductory ideas and examples 


There are altogether 4n — 2 conditions to satisfy. We have n cubic polyno- 
mials to be determined, each of which has 4 coefficients, for a total of 4 n 
unknown parameters. Since the conditions are linear, such a spline S(x ) 
surely exists and we can expect it to have two free parameters. It is con- 
ventional to choose these so that ,S'(.x) has a point of inflection at to and at 
tn • 

Now here is the solution. The functions Sj(x) are given by 

Si(x) = ■^(z i (t i+ 1 - x ) 3 + z i+1 (x - ti) 3 + (6y i+1 - h 2 z i+1 )(x - U) 

+ (6 yi - h 2 Zi){t i+ 1 -x)) (i = 0, 1, . . . ,n - 1), 

(1.4.7) 

provided that the numbers z i, , z n - 1 satisfy the simultaneous equations 


Zi—i + 4zi + z. 


i + 1 


6 

h 2 


(y*+i ~ tyi + Vi-i) (i = 1,2, ■ • • ,n - 1) (1.4.8) 


in which zq = z n = 0. It is easy t o check this, by sub stituting x = ti and 
x = t ^ |_i into (1.4.7) to verify that (1.4.5) and (1.4.6) are satisfied. Hence 
it remains only to solve the equations (1.4.8). 


The system of eq uations (1.4.8) is of the form (1.4.1), 
find the solutions from 


(1.4.3), (1.4.4) To do this, begin with the given set 
of points through which we wish to interpolate. Use them to 


hence we can 


write down 


6 


71—1 




(1.4.9) 


i=i 


Since (a, b , c) = (1,4, 1) in this example, we h ave r+ = = —2 ± y/3. Now our 
unknown generating function U(x) is given by (1.4.3), which reads as 


U(x) = 


x(D(x) + Zi + z n —\x n ) 
(1 + 4x + x 2 ) 


(1.4.10) 


in which the unknown numbers Z\, z n -i are determined by the requirement 


that the r ight sic 


equations (1.4.4) 


e of (1.4.10) be a polynomial, or equivalently by the two 
which become 


Zi + (V 3 - 2) n z n _ 1 = -D{V 3 - 2) 

z\ + (-V3 ~ 2) n z n —i = -D(-V3 - 2 ). 


When we know U(x), which is, after all, YVi=i z i xl ■> we can read off its 
coefficients to find the z’s, and use them in (1.4.7) to hnd the interpolating 
spline. 
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Example. 

Now let’s try an example with real live numbers in it. Suppose we are 
trying to fit the powers of 2 by a cubic spline on the interval [0, 5h Our 
input data are yi = 2* for i = 0, 1, . . . , 5, h = 1, and n = 5. From 


Then we solve 


we find that D(x) = 6x(l + 2x + 4a; 2 + 8x 3 ). 
that Zi = 204/209 and = 2370/209. Next (1.4.10) 


(1.4.11) 


tells us that 


(1.4.9) 


to find 


204 x 438 x 2 552 x 3 2370 x 4 
209 + 209 + 209 + 209 


and now we know all of the z£ s. 
polynomials that form the spline, 
subinterval [0, 1], is 


Finally, 


(1.4.7) 


tells us the exact cubic 


For example, £>o(x), which lives on the 


Sq(x) 


4 175 

1 T x T 

209 


34 


209 


Note that So(0) = 1 and So(l) = 2, so it correctly fits the data at the 
endpoints of its subinterval, and that S'q(O) = 0, so the fit will have an 
inflection point at the origin. The reader is invited to find all of the S/(x) 
(i = 0, 1, ... ,5), in this example, and check that they smoothly fit into each 
other at the points 1,2, 3, 4, in the sense that the functions and their first 
two derivatives are continuous. ■ 


One reason why you might like to fit some numerical data with a 
spline is beca use you w ant to integrate the function that the data represent. 


Integration of (1.4.7) from T; = ih to 1 = (i + 1 )h shows that 



Si(x)dx 


h h 3 

lj\Vi + Vi+ 1 ) -^(h + z i+ 1 ) 


(1.4.12) 


Thus, fitting some data by a spline and integrating the spline amounts to 
numerical integration by the trapezoidal rule with a third order correction 
term. If we sum (1.4.12) over i — 0, . . . , n~ 1 we get for the overall integral, 


i-nh 


S(x)dx = trap - ^( y 0 


h 3 

y 1 -y n -i +y n ) ~ ^ ( z i + z n -i) (1.4.13) 


in which ‘trap’ is the trapezoidal rule, and z\,z n -\ satisfy (1.4.11) 

Interpolation by spline functions is an important subject. It occurs in 
the storage of computer fonts, such as the one that you are now reading. 
Did you ever wonder how the shapes of the letters in the fonts are actually 
stored in a computer? One way is by storing the parameters of spline 
functions that fit the contours of the letters in the font. 
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1.5 Two independent variables 

In this section we will see how generating functions can be helpful in 
problems that involve functions of two discrete variables. We will use the 
opportunity also to introduce the binomial coefficients, since they are surely 
one of the most important combinatorial counting sequences. 

Let n and k be integers such that 0 < k < n. In how many ways can 
we choose a subset of k objects from the set {1,2, . . . , n}? Let’s pretend 
that we don’t know how this will turn out, and allow generating functions 
to help us find the answer. 

Suppose /(n, k ) is the answer to the question. We imagine that the 
collection of all possible subsets of k of these n objects are in front of us, 
and we will divide them into two piles. In the first pile we put all of those 
subsets that do contain the object ‘n’, and into the second pile we put all 
subsets that do not contain ‘n’. The first of these piles obviously contains 
f(n — 1, k — 1) subsets. The second pile contains f(n — 1, k) subsets. The 
two piles together originally contained f(n,k) subsets. So it must be that 
our unknown numbers /(n, k) satisfy the recurrence 

f(n,k) = f{n- l,k) + f(n- l,k- 1) (/(n, 0) = 1). (1.5.1) 

To find formulas for these numbers we use (what else?) generating 
functions. For each n = 0, 1, 2, . . . define the generating function 

B n (x ) = J ~2f{n,k)x k . 

k> 0 

Now multiply (1.5.1) throughout by x k and sum over k > 1 . The result is 
that B n {x) — 1 = ( B n -i(x ) — 1) + xB n -i(x), for n > 1, with Bq(x) = 1. 
Hence 

B n (x) = (1 + x)B n - 1 (x ) (n > 1; B 0 (x ) = 1). (1.5.2) 

Thus B n (x ) = (1 + x) n . for all n > 0. Our unknown number f(n, k ) 
is revealed to be the coefficient of x k in the polynomial (1 +x) n . To find 
a formula for /(n, k ) we might, for example, use Taylor’s formula, which 
would tell us that f{n,k) is the kth derivative of (1 + x) n , evaluated at- 
x — 0, all divided by k\. The differentiation is simple to do. Indeed, the 
kth derivative of (1 + x) n is n(n — l ) ■ ■ ■ (n — k + 1)(1 + x) n ~ k . If we put- 
x — 0 and divide by k\ we quickly discover that f(n. k). the number of 
A’-subsets of n things, is given by 

f n\ n\ n{n — l)(n — 2) • • • (n — k + 1) 

\k ) = k\(n-k)\ = k\ 


for integers n, k with 0 < k < n. 


(1.5.3) 
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That pretty well takes care of the binomial coefficients Q) when 0 < 
k < n and n, k are integers. When k is a negative integer the binomial 
coefficient Q) =0. 

Although the second member of equation (1.5.3) is difficult to decipher 
if n is not a nonnegative integer, the third member isn’t a bit hard to 
understand, even if n is a complex number, so long as k is a nonnegative 
integer. So that gives us an extension of the definition of the binomial 
coefficients to arbitrary complex numbers n, namely, 


f n\ n{n — l)(n — 2) • • • (n — & + 1) 

U/ = k! 


(integer k > 0). (1.5.4) 


Thus ( 3 3 ) = (— 3)(— 4)(— 5) / 6 = —10, and (*) = i{i — l)/2 = (—1 — 
etc. The generating function 

B n (x) = (1 T x) n = ^2 C^) xk = 1 + nx + ^ x 2 H 


0 / 2 , 


remains valid for all complex numbers n: the series terminates if n is a 
nonnegative integer, and it converges for \x\ < 1 in any case. 

What is the support of ()))? That is, for which values of n,k is it 
true that ()') 0? First, k must be a nonnegative integer. If n is not 

a nonnegative integer then ((') is surely nonzero, by (1.5.4). If n is a 
nonnegative integer then (1.5.4) shows that ())) ^ 0 iff 0 < k < n (in 
accordance with the combinatorial definition!). 

A very important consequence of these facts is that if n is a nonnegative 
integer, instead of writing something like X]fc=o we can e Q ua ^y well 

write YlkL-oo because the binomial coefficients vanish on all of the 

seemingly extra values of k that appear in the second form of the sum. The 
binomial coefficients “cut off” the sum by themselves, so there is no need 
to do it again with the range of summation. 

That being the case, we introduce two conventions that we will adhere 
to throughout the book. 

Convention 1. When the range of a variable that is being summed over 
is not specified, it is to be understood that the range of summation is from 
— oo to Too. 

Convention 2. When the range of a free variable in an equation is not 
specified, it is to be understood that the equation holds for all integer values 
of that variable. 

For example, we will write (2)x k — (1 T x) n . These conventions 
will save us an enormous amount of work in the sequel, mainly in that 
we won’t have to worry about changing the limits of summation when we 
change the variable of summation by a constant shift. 
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Let’s look at generating functions of some other kinds. If we multiply 
B n (x) by y" and sum only over n > 0, we find that 


£ Bn(x)y n = Y, £ Ct ) x ‘ iyn = £ (1 + x)ny 

n> 0 n>0 k ' n> 0 


1 

1 -y(l +x)' 


Thus for integer n > 0, Q) = [x k y n ]( 1 — y( 1 + a;)) -1 . 

For another exercise, let’s evaluate, for nonnegative integer k, the sum 
Y n ( fc)l/ n • Note that the index n runs over all integers, but that the sum- 
mand vanishes unless n > k. That sum is clearly 



\ x k ] t. = \ x k ] t. 

1 l l~y( l + x) 1 -y [ 

i ( y \ k = y k 

l-y'l-y* (1 ~y) k+1 ' 


For future reference we will place side-by-side these two power series gen- 
erating functions of the binomial coefficients: 


s(XH = (1+a;) 



(1 - y) k+1 ' 


(1.5.5) 


1.6 Another 2- variable case. 

This example will have a stronger combinatorial flavor than the pre- 
ceding ones. It concerns the partitions of a set. By a partition of a set S 
we will mean a collection of nonempty, pairwise disjoint sets whose union is 
S. Another name for a partition of S is an equivalence relation on S. The 
sets into which S is partitioned are called the classes of the partition. 

For instance, we can partition [5]* in several ways. One of them is as 
{123}{4}{5}. In this partition there are three classes, one of which contains 
1 and 2 and 3, another of which contains only 4, while the other contains 
only 5. No significance attaches to the order of the elements within the 
classes, nor to the order of the classes. All that matters is ‘who is together 
and who is apart-.’ 

Here is a list of all of the partitions of [4] into 2 classes: 

{12}{34}; {13}{24}; {14}{23}; {123}{4}; {124}{3}| {134}{2} ; {1}{234}. 

( 1 . 6 . 1 ) 

There are exactly 7 partitions of [4] into 2 classes. 

The problem that we will address in this example is to discover how 
many partitions of [n\ into k classes there are. Let denote this number. 


Recall that [n] is the set {1,2,..., n} 
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It is called the Stirling number of the second kind. Our list above shows 
that { 2 } = 7. 

To find out more about these numbers we will follow the method of 
generating functions. First we will find a recurrence relation, then a few 
generating functions, then some exact formulas, etc. 

We begin with a recurrence formula for {))}, and the derivation will 
be quite similar to the one used in the previous example. 

Let positive integers n, k be given. Imagine that in front of you is the 
collection of all possible partitions of [n] into k classes. There are exactly 
{ " | of them. As in the binomial coefficient example, we will carve up this 
collection into two piles; into the first pile go all of those partitions of [n] 
into k classes in which the letter n lives in a class all by itself. Into the 
second pile go all other partitions, i.e., those in which the highest letter n 
lives in a class with other letters. 

The question is, how many partitions are there in each of these two 
piles (expressed in terms of the Stirling numbers)? 

Consider the first pile. There, every partition has n living alone. Imag- 
ine marching through that pile and erasing the class ; (n)' that appears in 
every single partition in the pile. If that were done, then what would re- 
main after the erasures is exactly the complete collection of all partitions 
of [n — 1] into k — 1 classes. There are of these, so there must have 


been partitions in the first pile. 

That was the easy one, but now consider the second pile. There the 
letter n always lives in a class with other letters. Therefore, if we march 
through that pile and erase the letter n wherever it appears, we won’t affect 
the numbers of classes; we’ll still be looking at partitions with k classes. 
After erasing the letter n from everything, our pile now contains partitions 
of n — 1 letters into k classes. However, each one of these partitions appears 
not just once, but several ti mes. 

For example, in the list (1.6.1) the second pile contains the partitions 


{ 12}{34}; { 13}{24}; {14}{23}; {124}{3}; {134}{2}; {1}{234}, (1.6.2) 


and after we delete ‘4’ from every one of them we get the list 


{ 12} {3} ; { 13}{2} ; {1}{23}; {12}{3}; {13}{2}; {1}{23}. 

What we are looking at is the list of all partitions of [3] into 2 classes, 
where each partition has been written down twice. Hence this list contains 
exactly 2 } | } partitions. 

In the general case, after erasing n from everything in the second pile, 
we will be looking at the list of all partitions of [n — 1] into k classes, where 
every such partition will have been written down k times. Hence that list 
will contain exactly A; { 77 1 } partitions. 
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Therefore the second pile must have also contained k { " f 1 } partitions 
before the erasure of n. 

The original list of partitions was therefore split into two piles, the 
first of which contained partitions and the second of which contained 

k^f 1 } partitions. 

It must therefore be true that 

CHriMV) 

To determine the range of n and k , let’s extend the definition of to all 
pairs of integers. We put { ' ' } = 0 if k > n or n < 0 or k < 0. Further, 
{ " } = 0 if n ^ 0, and we will take = 1. With those conventions, the 
recurrence above is valid for all (n, k ) other than (0, 0), and we have 

{i}-t:Mv} <>« 

The stage is now set for finding the generating functions. Again there 
are three natural candidates for generating functions that might be find- 
able, namely 

k v J 

B k( x ) = J2{ n k \ xn (1-6.4) 

C(x,y ) = 

Before we plunge into the calculations, let’s pause for a moment to 
develop some intuition about which of these choices is likely to succeed. To 
find A n (y ) will involve multiplying (1.6.3) by y k and summing over k. Do 
you see any problems with that? Well, there are some, and they arise from 
the factor of k in the second term on the right. Indeed, after multiplying by 
y k and summing over k we will have to deal with something like k\^y k . 
This is certainly possible to think about, since it is related to the derivative 
of A n (y ), but we do have a complication here. 

If instead we choose to find B k (x), we multiply (1.6.3) by x n and sum 
on n. Then the factor of k that seemed to be troublesome is not involved 
in the sum, and we can take that k outside of the sum as a multiplicative 
factor. 

Comparing these, let’s vote for the latter approach, and try to find the 
functions B k {x) (k > 0). Hence, multiply (1.6.3) by x n and sum over n, 
to get 

B k (x ) = xB k - i(x) + kxB k (x ) ( k > 1 ; B 0 (x) = 1). 
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This leads to 

Bk{x) = - *^ B k - i(x) (k > 1; B 0 (x) = 1) 

and finally to the evaluation 

Bk( x ) = 5^ {J 1 " = (1 _ x )(i _ 2x)(l - 3x) • • ■ (1 - kx) 

(1.6.5) 

The problem of finding an explicit formula for the Stirling numbers 
could therefore be solved if we could find the power series expansion of the 
function that appears in (1.6.5). That, in turn, calls for a dose of partial 
fractions, not of Taylor’s formula! 

The partial fraction expansion in question has the form 

1 a j 

(1 — x)(l — 2x) •••(! — kx) (1 — jx ) ' 


To find the a's. fix r, 1 < r < k, multiply both sides by 1 — rx , and let 
x = 1/r. The result is that 

1 


a r 


(1 — l/r)(l — 2/r) • • • (1 — (r — l)/r)(l — (r + 1 )/r) ■ 

r k— 1 

k — r ' 


= (-l) 


(r — 1)1 (k — r)\ 


(1 <r<k). 


From (1.6.5) and (1.6.6) we obtain, for n > k, 

x 



r nl 1 

l*J 



= \x 


n—k ' 


(1 — x)(l — 2x) •••(! — kx) 

1 

(1 — x)(l — 2x) •••(! — kx) 


= \x 


n—k' 


r=l 


a T 


rx 


(k> 1 ) 


= > a r \x 


E 

r — 1 
k 

r— 1 
k 

D-D 


n — fc i 


1 — rx 


n—k 


k — r 


,k - 1 


„n—k 


r = 1 
k 


(r — l)!(fc — r)! 


D-1) 


k—r 


r — 1 


r!(/c — r)! 


(n, k > 0), 


(1 — fc/r) 

( 1 . 6 . 6 ) 


( 1 . 6 . 7 ) 
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which is just what we wanted: an explicit formula for { " j . Do check that 
this formula yields { } = 7, which we knew already. At the same time, 
note that the formula says that { ” } = 2 n_1 — 1 (n > 0). Can you give an 
independent proof of that fact? 

Next, we’re going to try one of the other approaches to solving the 
recurren ce for th e Stirling numbers, namely that of studying the functions 
A n (y) in (1.6.4) This method is much harder to carry out to completion 
than the one we just used, but it turns out that these generating functions 
have other uses that are quite importa nt from a theoretical point of view. 

Therefore, let’s fix n > 0, multiply (1.6.3) by y k and sum over k. The 
result is 


A n (y) = 


JU v J h. 


n — 1 


y 


d 


= yAn-xiy) + {y— )A n - 1 (y) 
dy 

= {y( 1 + DyfiAn-M (n > 0; Ao(y) = 1). 


( 1 . 6 . 8 ) 


The novel feature is the appearance of the differentiation operator d/dy 
that was necessitated by the factor k in the recurrence relation. 

Hence each function A n is obtained from its predecessor by applying 
the operator y(l + D y ). Beginning with Aq = 1, we obtain successively 
y, y + y 2 , y + 3 y 2 + y 3 , . . .. but as far as an explicit formula is concerned, we 
find only that 

A n (y) = {y + yD y } n l (n > 0) (1.6.9) 


by this approach. 

There are, however, one or two things that can be seen more clearly 
from these gen erating func tions than from the B n (x)’s. One of these will 


be discussed in section 4.5 and concerns the shape of the sequence { )) } for 
fixed n. as k runs from 1 to n. It turns out that the sequence increases for 
a while and then it decreases. That is, it has just one maximum. Many 
combinatorial sequences are unimodal , like this one, but in some cases it 
can be very hard to prove such things. In this case, thanks to the formula 
(1.6.9), we will see that i t’s not h ard at all. 

For an application of (1.6.7), recall that the Stirling number | j is the 
number of ways of partitioning a set of n elements into k classes. Suppose 
we don’t particularly care how many classes there are, but we want to know 
the number of ways to partition a set of n elements. Let these numbers 
be {6(n)}o°. They are called the Bell numbers. It is conventional to take 
6(0) = 1. The sequence of Bell numbers begins 1, 1, 2, 5, 15, 52, . . . 

Can we find an explicit formula for the Bell numbers? Nothing to it. 
.6.7) we have an explicit formula for | ” } . If we sum that formula from 
k = 1 to n we will have an explicit formula for b(n). However , there’s one 


In 
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more thing that it is quite profitable to notice. The formula (1.6.7) 


is valid 


for all positive integer values of n and k. In partic ular, it is valid if k > 


n. 


But {;} = o it * > n. This means that the formula) (1.6.7) 


doesn’t have to 

be told that = 0; it knows it; i.e., if we blissfully insert n = 13, k = 19 
into the monster sum and work it all out, we will get 0. 

He nce, to calculate the Bell numbers, we can sum the last member of 


(1.6.7) from k = 1 to M, where M is any number you please that is > n. 


Let’s do it. The result is that 


M k 

b(n ) 


\k—r 


„n— 1 


k= 1 r = 1 

(r — 1 )!(/c 

^ n— 1 

V 

M k- r 

fri (r- 1)1 

( k — r)\ 

r n- 1 

f V (-!) S 

( r ~ !) ! 

\h *'■ 


But now the number M is arbitrary, except that M > n. Since the partial 
sum of the exponential series in the curly braces above is so inviting, let’s 
keep n and r fixed, and let M — > oo. This gives the following remarkable 
formula for the Bell numbers (check it yourself for n = 1): 

1 x a T n 

b(n) = -y — (n > 0). (1.6.10) 

e z J r\ 

r> o 

This formula for the Bell numbers, although it has a certain charm, 
doesn’t lend itself to computation. From it, however, we can derive a 
generating function for the Bell numbers that is unexpectedly simple and 
elegant. We will look for the generating function in the form 

B(x) = J2^T- xn - ( 1 . 6 . 11 ) 

n> 0 


This is the first time we have found it necessary to introduce an extra 
factor of 1/n! into the coefficients of a generating function. That kind of 
thing happens frequently, however, and we will discuss in chapter 2 how to 


recognize when extra factors like these will be useful. A generating function 
of the form (1.6.11), with the l/n!’s thrown into the coefficients, is called 
an exponential generating function. We would say, for instance, that l B(x) 
is the exponential generating function of the Bell numbers.’ 

When we wish to distinguish the various kinds of generating functions, 
we may use the phrase the ordinary power series generating function of 
the sequence {a n } is Yh n a n xTl or the exponential generating function of the 
sequence {a n } is ^) n o n x n /n!. 
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To find B(x) explicitly, take the formula (1.6.10) which is valid for 
n > 1, multiply it by x n /nl (don’t forget the n!), and sum over all n > 1. 
This gives 


B(x ) — 1 = 


1 rr n r>n — l 

, 1 . ^ — A Ju ^ — A / 

e n\ (r — 1)! 

n> 1 r>l v ’ 


1 ( rx ) 

e r! n\ 

r>l n> 1 


■ e • ^ — / r ! 

r>l 

1 

= (-){e e ~e} 
= e eX_1 — 1. 


We have therefore shown 


Theorem 1.6.1. The exponential generating function of the Bell numbers 
is e e -1 , i.e., the coefficient of x n /n\ in the power series expansion of e e -1 
is the number of partitions of a set of n elements. 

This result is surely an outstanding example of the power of the gen- 
erating function approach. The Bell numbers themselves are complicated, 
but the generating function is simple and easy to remember. 

The next novel element of this story is the fact that we can go from 
generating functions to recurrence formulas, although in all our examples 
to date the motion has been in the other direction. We propose now to 
derive from Theorem 1.6.1 a recurrence formula for the Bell numbers, one 
that will make it easy to compute as many of them as we might wish to 
look at. 

First, the theorem tells us that 


E 

n> 0 


b(n ) 

71 \ 


= e 


e x — 1 


(1.6.12) 


We are going to carry out a very standard operation on this equation, but 
the first time this operation appears it seems to be anything but standard. 


The x(d/dx) log operation 

(1) Take the logarithm of both sides of the equation. 

(2) Differentiate both sides and multiply through by x. 

(3) Clear the equation of fractions. 

(4) For each n, find the coefficients of x n on both sides of the equation 
and equate them. 

Although the best motivation for the above program is the fact that 
it works, let’s pause for a moment before doing it, to see why it is likely to 


1.6 Another 2-variable case. 
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work. The point of taking logarithms is to simplify the function e e _1 , 
whose power series coefficients are quite mysterious before taking loga- 
rithms, and are quite obvious after doing so. The price for that simpli- 
fication is that on the left side we have the log of a sum, which is an 
awesome thing to have. The next step, the differentiation, changes the log 
of the sum into a ratio of two sums, which is much nicer. The reason for 
multiplying through by x is that the differentiation dropped the power of x 
by 1 and it’s handy to restore it. After clearing of fractions we will simply 
be looking at two sums that are equal to each other, and the work will be 
over. 

we have 


In this case, after step 1 is applied to (1.6.12) 


''n>0 


-x n I = e x 


1 . 


Step 2 gives 


E r 


nb(n)x r ‘ 


Er 


b(n)x r ‘ 


xe 


To clear of fractions, multiply both sides by the denominator on the left, 
obtaining 


E 


nb{n)x r 

n\ 


xe 


E 


b(n)x r 

n\ 


Finally, we have to identify the coefficients of x n on both sides of this 
equation. On the left it’s easy. On the right we have to multi ply two pow er 
series together first, and then identify the coefficient. Since in chapter 2 we 


will work out a general and quite easy-to-use rule for doing things like this, 
let’s postpone this calculation until then, and merely quote the result here. 
It is that the Bell numbers satisfy the recurrence 


b H = 


n 


1 


b(k) ( n > 1; 6(0) = 1). 


(1.6.13) 


We have now seen several examples of how generating functions can 
be used to find recurrence relations. It often happens that the method of 
generating functions finds a recurrence, and only later are we able to give 
a direct, combinatorial interpretation of the recurrence. In some cases, re- 
currences are known that look like they ought to have simple combinatorial 
explanations, but none have yet been found. 
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Exercises 

1. Find the ordinary power series generating functions of each of the follow- 
ing sequences, in simple, closed form. In each case the sequence is defined 
for all n > 0 . 

(a) a n = n 

(b) a n = an + (5 

(c) a n = n 2 

(d) a n = an 2 + (3n + 7 

(e) a n = P{n), where P is a given polynomial, of degree m. 

(f) an = 3 n 

( g ) a n = 5 • 7 n — 3 • 4 n 

2. For each of the sequences given in part 1, find the exponential generating 
function of the sequence in simple, closed form. 

3. If f(x) is the ordinary power series generating function of the sequence 
{a n }n> 0 , then express simply, in terms of f(x), the ordinary power series 
generating functions of the following sequences. In each case the range of 
n is 0 . 1 . 2 ,... 

(a) {a n + c } 

(b) {aa n + c} 

(c) {na n } 

(d) {P(n)a n }, where P is a given polynomial. 

(e) 0 , ai, 02 , a 3 , . . . 

(f) 0 , 0 , 1 , < 23 , a. 4 , < 25 , . . . 

(g) a 0 , 0 , o 2 , 0 , a 4 , 0 , a 6 , 0 , a 8 , 0 , . . . 

(h) ai,a 2 ,a 3 ,... 

(i) {a n +h} (h a given constant) 

(j) { a n + 2 + 3o n+ i + a n } 

(k) {a n ~ |-2 a n -\- 1 a n ,y 

4. Let f(pc ) be the exponential generating function of a sequence {a n }. For 
each of the sequences in exercise 3, find the exponential generating function 
simply, in terms of f(x). 



Exercises 


25 


5. Find 

(a) [x n ]e 2x 

(b) [x n /n\]e ax 

(c) [x n /n!] since 

(d) [x n ]{l/((l — ax)(l — bx))} (a ^ b) 

(e) [x n ](l + x 2 ) m 

6. In each part, a sequence {o n } n > o satisfies the given recurrence relation. 
Find the ordinary power series generating function of the sequence. 


(a) CL n - |_i 3 CL n “I - 2 

(b) a n+ 1 = aa n + (3 

(c) a n - 1-2 = 2a n+ i i 

(d) a n+ 1 = a n /3 + 1 


(n > 0; ao = 0) 

(n > 0; ao = 0) 

(n > 0; ao = 0; ai = 1) 
(n > 0; ao = 0) 


7. Give a direct combinatorial proof of the recurrence (1.6.13), as follows: 
given n; consider the collection of all partitions of the set [nj. There are 
b(n) of them. Sort out this collection into piles numbered k = 0, 1, . . . , n — 1, 
where the Tcth pile consists of all partitions of [n] in which the class that 
contains the letter ‘n’ contains exactly k other letters. Count the partitions 
in the kth pile, and you’ll be all finished. 


8. In each part of problem 6, find the exponential generating function of 
the sequence (you may have to solve a differential equation to do so!). 

9. A function / is defined for all n > 1 by the relations (a) /( 1) = 1 and 
(b) f(2n) = f(n) and (c) f(2n + 1) = f(n) + f(n + 1). Let 


F ( x ) = f(n)x n 1 

n> 1 


be the generating function of the sequence. Show that 

F(x) = (1 + x + x 2 )F(x 2 ), 


and therefore that 


F{x) = jl + x 2,3 + x 2l+ | . 

3 > 0 


10. Let X be a random variable that takes the values 0, 1, 2, . . . with re- 
spective probabilities pQ.p 1 .p 2 - . . ., where the p's are given nonnegative real 
numbers whose sum is 1. Let P(x) be the opsgf of {p n }- 

(a) Express the mean p and standard deviation a of X directly in terms 
of P{x). 
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(b) Two values of X are sampled independently. What- is the probability 
Pn ; t-hat- their sum is n? Express the opsgf P 2 (x ) of {p n '} in terms 
of P(x). 

(c) k values of X are sampled independently. Let p'n' ! be the probability 

that their sum is equal to n. Express the opsgf Pf, (x) of {pi fc) } n > o 
in terms of P{x). 

(d) Use the results of parts (a) and (c) to find the mean and standard 
deviation of the sum of k independently chosen values of X , in terms 
of // and a. 

(e) Let A(x ) be a power series with A(0) = 1, and let B(x ) = A{x) k . It 
is desired to compute the coefficients of B(x), without raising A(x) 
to any powers at all. Use the hD log 1 method to derive a recurrence 
formula that is satisfied by the coefficients of B(x). 

(f) A loaded die has probabilities .1, .2, .1, .2, .2, .2 of turning up 
with, respectively, 1, 2, 3, 4, 5, or 6 spots showing. The die is then 
thrown 100 times, and we want to calculate the probability p * that 
the total number of spots on all 100 throws is < 300. Identify p* 
as the coefficient of .x 300 in the power series expansion of a certain 
function. Say exactly what the function is (you are not being asked 
to calculate p*). Use the result of part (e) to say exactly how you 
would calculate p* if you had to. 

(g) A random variable X assumes each of the values 1,2, ...,m with 
probability 1/m. Let S n be the result of sampling n values of X 
independently and summing them. Show that for n = 1,2,..., 

Prob {■?„<;} = T £ ( _i r 

r 



11. Let f{n) be the number of subsets of [n] that contain no two consecu- 
tive elements, for integer n. Find the recurrence that is satisfied by these 
numbers, and then ‘find’ the numbers themselves. 

12. For given integers n, A-, let f(n, k ) be the number of A’-subsets of [n] t-hat- 
cont-ain no two consecutive elements. Find the recurrence that is satisfied 
by these numbers, find a suitable generating function and find the num- 
bers themselves. Show the numerical values of /(n, k) in a Pascal triangle 
arrangement, for n < 6. 

13. By comparing the results of the above two problems, deduce an identity. 
Draw a picture of the elements of Pascal’s triangle that are involved in this 
identity. 

14. Let the integers 1,2, ... ,n be arranged consecutively around a circle, 
and let g(n ) be the number of ways of choosing a subset of these, no two 



Exercises 


27 


consecutive on the circle. That is, g differs from the / of problem 11 in 
that n and 1 are now regarded as consecutive. Find g(n). 


15. As in the previous problem, find, analogously to problem 12 above, the 
number g(n , k) of ways of choosing k elements from n arranged on a circle, 
such that no two chosen elements are adjacent on the circle. 


16. Find the coefficient of x n in the power series for 


ffr) 


1 

(1-rr 2 ) 2 ’ 


first by the method of partial fractions, and second, give a much simpler 
derivation by being sneaky. 

17. An inversion of a permutation a of [n] is a pair of letters i. j such that 
i < j and a{i) > a (j). In the 2-line form of writing the permutation, an 
inversion shows up as a pair that is ‘in the wrong order’ in the second line. 
The permutation 

_/l23456789\ 

^4 9 2 5 8 1 6 7 3 J 

of [9] has 19 inversions, namely the pairs (4,2), (4,1), ..., (7,3). Let b(n,k) 
be the number of permutations of n letters that have exactly k inversions. 
Find a ‘simple’ formula for the generating function B n (x) = 6(n, k)x k . 

Make a table of values of 6(n, k) for n < 5. 

18. 

(a) Given n 1 k. For how many of the permutations of n letters is 
it true that their first k values decrease? 

(b) What is the average length of the decreasing sequence with 
which the values of a random n-permutation begin? 

(c) If /(n, k) is the number of permutations that have exactly k 
ascending runs, find the Pascal-triangle-type recurrence sat- 
isfied by f(n, k ). They are called the Euler numbers. As an 
example, the permutation 

/l 2345678 9\ 

\ 4 1 6 9 2 5 8 3 7 J 

has 4 such runs, namely 4, 1 6 9, 2 5 8, and 3 7. 

19. Consider the 256 possible sums of the form 


6 i + 62 + 263 + 564 T IO 65 + 1066 T 2067 T 506s (1) 


where each e is 0 or 1. 
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(a) For each integer n. let C n be the number of different sums t-hat- 
represent n. Write the generating polynomial 


Co + CiX + C 2 X 2 + C 3 X 3 H + CggX 99 


as a product. 

(b) Next, consider all of the possible sums that are formed as in (1) 
where now the e’s can have any of the three values —1, 0, 1. For each 
integer n, let D n be the number of different sums that represent n. 
Show that some integer n is representable in at least 33 different 
ways. Then write the generating function 


99 

E 

n= — 99 


D n x n 


as a product. 

(c) Generalize the results of parts (a) and (b) of this problem by re- 
placing the particular set of weights by a general set. Factor the 
polynomial that occurs. 

(d) In the general case of part (d) of this problem, state precisely what 
all of the zeros of the generating polynomial are, and state precisely 
what the multiplicity of each of the zeros is, in terms of the set of 
weights. 

20. Let f(n,m,k) be the number of strings of n 0’s and l’s that contain 
exactly m l’s, no k of which are consecutive. 

(a) Find a recurrence formula for /. It should have /(n, m, k ) 
on the left side, and exactly three terms on the right. 

(b) Find, in simple closed form, the generating functions 

F k(x,y)= Y f(n,m,k)x n y m {k = 1,2,...). 

n,m> 0 


(c) Find an explicit formula for f(n, m , k) from the generat- 
ing function (this should involve only a single summation, 
of an expression that involves a few factorials). 

21 . 

(a) We want to find a formula for the nth derivative of the function 
e e . Differentiate it a few times, study the pattern, and conjecture 
the form of the answer for general n, including some constants to 
be determined. Then find a recurrence formula for the constants 
in question, and identify them as some ‘famous’ numbers that we 
have studied. 
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(b) Next let f(x\, . . . , x n ) be some function of n variables. Find a 
formula for the mixed partial derivative 


d n 

dx\dx2 ■ ■ ■ dx n 


e 


f 


that expresses it in terms of various partial derivatives of / itself. 
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Chapter 2 
Series 

This chapter is devoted to a study of the different kinds of series t-hat- 
are widely used as generating functions. 

2.1 Formal power series 

To discuss the formal theory of power series, as opposed to their an- 
alytic theory, is to discuss these series as purely algebraic objects, in their 
roles as clotheslines, without using any of the function-theoretic properties 
of the function that may be represented by the series or, indeed, without- 
knowing whether such a function exists. 

We study formal series because it often happens in the theory of gen- 
erating functions that we are trying to solve a recurrence relation, so we 
introduce a generating function, and then we go through the various ma- 
nipulations that follow, but with a guilty conscience because we aren’t sure 
whether the various series that we’re working with will converge. Also, we 
might find ourselves working with the derivatives of a generating function, 
still without having any idea if the series converges to a function at all. 

The point of this section is that there’s no need for the guilt, because 
the various manipulations can be carried out in the ring of formal power 
series, where questions of convergence are nonexistent. We may execute 
the whole method and end up with the generating series, and only then 
discover whether it converges and thereby represents a real honest function 
or not. If not, we may still get lots of information from the formal series, 
but maybe we won’t be able to get analytic information, such as asymptotic 
formulas for the sizes of the coefficients. Exact formulas for the sequences 
in question, however, might very well still result, even though the method 
rests, in those cases, on a purely algebraic, formal foundation. 

The series 

f = l + x + 2x 2 -t- 6x 3 + 24x 4 + 120a: 5 + •■■-(- n!ir n + • • • , (2. IT) 

for instance, has a perfectly fine existence as a formal power series, despite 
the fact that it converges for no value of x other than x = 0, and therefore 
offers no possibilities for investigation by analytic methods. Not only that, 
but this series plays an important role in some natural counting problems. 

A formal power series is an expression of the form 

a-o + a\x + a 2 X 2 + • • • 

where the sequence {a n }g° is called the sequence of coefficients. To say that 
two series are equal is to say that their coefficient sequences are the same. 
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We can do certain kinds of operations with formal power series. We 
can add or subtract them, for example. This is done according to the rules 

E 0 ^ n ± E = E^° n ± b n)x n . 

n n n 

Power series can be multiplied by the usual Cauchy product rule, 

E a nX n E b nX n = E C n xU ( c n = E a k b n-k) • ( 2 - 1 . 2 ) 

n n n k 

It is certainly this product rule that accounts for the wide applicability of 
series methods in combinatorial problems. This is because frequently we 
can construct all a n of the objects of type n in some family by choosing an 
object of type k and an object of type n — k and stitching them together 
to make the object of type n. The number of ways of doing that will be 
afca n _fc, and if we sum on k we find that the Cauchy product of two formal 
series is directly relevant to the problem that we are studying. 

If we follow the multiplication rule we obtain, for instance, 

(1 — x)(l + x + x 2 + x 3 + • • •) = 1 . 

Thus we can say that the series (1 — x) has a reciprocal, and that reciprocal 
is 1 + x + x 2 + ■ ■ ■ (and the other way around, too). 

Proposition. A formal power series f = X^n>o a r> x " has a reciprocal if 
and only if oo ^ 0. In that case the reciprocal is unique. 

Proof. Let / have a reciprocal, namely 1// = ^ n>0 b n % n • Then /-(l//) = 
1 and according to (2.1.2), cq = 1 = aobo, so oo ^ 0. Further, in this case 

( 2 . 1 . 2 ) tells us that for n > 1 , c n = 0 = a k b n-k- from which we find 

b n = (— 1/ a 0 ) E a kK-k (n > 1). (2.1.3) 

fc> l 

This determines bi, 62 , . . . uniquely, as claimed. 

Conversely, suppose a 0 yf 0. Then we can determine 60 , 61 ,... from 

(2.1.3) , and the resulting series b n x n is the reciprocal of /. >g| 

The collection of formal power series under the rules of arithmetic that 
we have just described forms a ring, in which the invertible elements are 
the series with nonvanishing constant term. 

The above idea of a reciprocal of a formal power series is not to be 
confused with the subtler notion of the inverse of such a series. The inverse 
of a series /, if it exists, is a series g such that f{g(x)) = g{f(x)) = x. 
When can such an inverse exist? First we need to be able to define the 
symbol f{g(x)), then we can worry about whether or not it is equal to x. 
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If / = J2 n a n.x n , then f(g(x )) means 

f(g(x)) = ^2a n g(x) n . (2.1.4) 

n 

If the series g(x) has a nonzero constant term, go, then every term of the 
series (2.1.4) may contribute to the coefficient of each power of x. On the 
other hand, if go = 0, then we will be able to compute the coefficient of, 
say, x 57 in (2.1.4) from just the first 58 terms of the series shown. Indeed, 
notice that every single term 

a n g(x) n = a n (gix + g 2 x 2 + . . .) n 

= a n x n \gi + g 2 x + .. .) n 

with n > 57 will contain only powers of x higher than the 57th, and there- 
fore we won’t need to look at those terms to find the coefficient of x 57 . 

Thus if go = 0 then the computation of each one of the coefficients of 
the series f{g(x)) is a finite process, and therefore all of those coefficients 
are well defined, and so is the series. If go 0, though, the computation 
of each coefficient of f{g(x)) is an infinite process unless / is a polynomial, 
and therefore it will make sense only if the series ‘converge.’ In a formal, 
algebraic theory, however, ideas of convergence have no place. Thus the 
composition f(g(x)) of two formal power series is defined if and only if 
go = 0 or f is a polynomial. 

For instance, the series e e _1 is a well defined formal series, whereas the 
series e e is not defined, at least from the general definition of composition 
of functions. 

To return to the question of finding a series inverse of a given series /, 
we see that if such an inverse series g exists, then 

f(g(x)) = g(f(x)) = x (2.1.5) 

must both make sense and be true. We claim that if /( 0) = 0 the inverse 
series exists if and only if the coefficient of x is nonzero in the series /. 

Proposition. Let the formal power series f, g satisfy (2.1.5) and /( 0) = 0. 
Then f = f\x + f 2 x 2 4 (fa 0), and g = g x x + g 2 x 2 h ( gi ^0). 

Proof. Suppose that / = f r x r 4 — ■ and g = g s x s 4 , where r, s > 0 and 

f r g s 0. Then f{g(x)) = x — f r g r s x ra 4 , whence rs = 1, and r = s = 1, 

as claimed. ■ 

In the ring of formal power series there are other operations defined, 
which mirror the corresponding operations of function calculus, but which 
make no use of limiting operations. 

The derivative of the formal power series / = a n x n is the series 
f = J2 n na n x n ~ 1 . Differentiation follows the usual rules of calculus, such 
as the sum, product, and quotient rules. Many of these properties are even 
easier to prove for formal series than they are for the functions of calculus. 
For example: 
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Proposition. If f = 0 then f = oo is constant. 

Proof. Take another look at the ‘=’ sign in the hypothesis f = 0. It means 
that the formal power series f is identical to the formal power series 0, and 
that means that each and every coefficient of the formal series f is 0. But- 
t.he coefficients of f are aq, 2 a. 2 , 3a3, . . so each of these is 0, and therefore 
a,j = 0 for all j >1, which is to say that / is constant. |j 

Next, try this one: 

Proposition. If f = f then f = ce x . 

Proof. Since f = /, the coefficient of x n must be the same in / as 
in /', for all n > 0. Hence (n + l)a n+ i = a n for all n > 0, whence 
a n - |_i = a n f(n + 1) (n > 0). By induction on n, a n = ao/n\ for all n > 0, 
and so / = aoe x . | 


2.2 The calculus of formal ordinary power series generating func- 
tions 

Operations on formal series involve corresponding operations on their 
coefficients. If the series actually converge and represent functions, then 
operations on those functions correspond to certain operations on the power 
series coefficients of the expansions of those functions. In this section we 
will explore some of these relationships. They are of great importance in 
helping to spot which kind of generating function is appropriate for which 
kind of recurrence relation or other combinatorial situation. 

Definition. The symbol f fff {°n}o° means that the series f is the ordi- 
nary power series (‘ops’) generating function for the sequence {o n }o°. That 
is, it means that f = a n x n . 

Suppose / TT {°n}o°- Then what generates {o n _|_i}o°? To answer 
that we do a little calculation: 

y: a n+i xn = - yy 

n> 0 m> 1 

Therefore 

/ KKT => ((/ - ao)/x) °jz K+i}o°°- (2-2.1) 

Thus a shift of the subscript by 1 unit changes the series represented 
to the difference quotient (/ — ao)/x. If we shift by 2 units, of course, we 
just iterate the difference quotient operation, and find that 


= (fix) - /( 0 )) 


{On+2}f 


((/ - a 0 )/x) - oq 


X 


f do d\x 
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Note how this point of view allows us to see ‘at a glance’ that the 
Fibonacci recurrence relation F n+ 2 = F n + \ + F n ( n > 0; F 0 = 0; F\ = 
1) translates directly into the ordinary power series generating function 
relation 

= i + 

Indeed, the purpose of this section is to develop this facility for passing 
from sequence relations to series relations quickly and conveniently. 

Rule 1. If f ops {a n }o°, then, for integer h > 0. 

oo ops f ~ a o ah-ix ^ 1 

o 4 — * 



Next let’s look into the effect of multiplying the sequence by powers of 
n. Again, suppose that / FT { a n}o°- Then what generates the sequence 
{na n } q°? The question means this: can we express the series J2 n na n x n 
in some simple way in terms of the series / = a n x n 7 The answer is 
easy, because the former series is exactly xf . Therefore, to multiply the 
nth member of a sequence by n causes its ops generating function to be 
‘multiplied’ by x(d/dx), which we will write as xD. In symbols: 

/ ™ {on}“ => (xDf) FT {na n }^. (2.2.2) 

As an example, consider the recurrence 


(n + l)o n+ i = 3o n + 1 (n > 0; a 0 = 1). 


If / is the opsgf of the sequence {a n } q°, then from Rule 1 and (2.2.2), 


/' 


3 / + 


1 

1 — x ’ 


which is a first order differential equation in the unknown generating func- 
tion, and it can be solved by standard methods. 

Next suppose / FT {°n}o°- Then what generates the sequence 

{n 2 a n }g°? 


Obviously we re-apply the multiply-by-n operator xD. so the answer is 
( xD) 2 f . In general, 

(■ xD) k f °JZ {n k a n } n >o- 

OK, what generates {(3 — 7n 2 )a n } n >o? Again obviously, we do the 
same thing to xD that is done to n, i.e., (3 — 7 (xD) 2 )f is the answer. The 
general prescription is: 
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Rule 2. If f fff { (l n } o° ,■ and P is a polynomial, then 

P(xD)f {P(n)a n } n > 0 . 


Example 1. 

Find a closed formula for the sum of the series ^2 n>0 {n 2 + In + 5)/n!. 

According to the rule, the answer is the value at x =1 of the series 

{(xD) 2 + A{xD ) + 5}e x = {x 2 + x}e x + Axe x + 5e x 

= (x 2 + 5x + 5)e x . 

Therefore the answer to the question is lie. 

But we cheated. Did you catch the illegal move? We took our gen- 
erating function and evaluated it at x = 1, didn’t we? Such an operation 
doesn’t exist in the ring of formal series. There, series don’t have ‘values’ 
at particular values of x. The letter x is purely a formal symbol whose 
powers mark the clothespins on the line. 

What can be evaluated at a particular numerical value of ir is a power 
series that converges at that x, which is an analytic idea rather than a formal 
one. The way we make peace with our consciences in such situations, which 
occur frequently, is this: if, after writing out the recurrence relation and 
solving it by means of a formal power series generating function, we find 
that the series so obtained converges to an analytic function inside a certain 
disk in the complex plane, then the whole derivation that we did formally 
is actually valid analytically for all complex x in that disk. Therefore we 
can shift gears and regard the series as a convergent analytic creature if it 
pleases us to do so. | 


Example 2. 

Find a closed formula for the sum of the squares of the first N positive 
integers. 

To do that, begin with the fact that 


N 


n= 0 


X N+1 _ i 
X — 1 


and notice that if we apply (xD) 2 to both sides of this relation and then 
set x = 1, the left side will be the sum of squares that we seek, and the 
right side will be the answer! Hence 


N 


]Tn 2 


{xD) 2 


x N+ 1 - 1 1 

x — 1 J 


X=1 
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After doing the two differentiations and lots of algebra, the answer emerges 
as 

XV ^+l^ + D (N — 1,2,...), 

n = 1 

which you no doubt knew already. Do notice, however, that the generating 
function machine is capable of doing, quite mechanically, many formidable- 
looking problems involving sums. | 

Our third rule will be a restatement of the way that two opsgf’s are 
multiplied. 

Rule 3. If f ° ps , {a n }o° and g ZZ {6 n }o°> then 


fa °JZ 



(2.2.3) 


Now consider the product of more than two series. For instance, in the 
case of three series, if /, g, h are the series, and if they generate sequences 
a, b and c, respectively, then a brief computation shows that fgh generates 
the sequence 

| ^2 °Actj • (2.2.4) 

^r+s-|-i=n ' n=0 

A comparison with Rule 3 above will suggest the general formulas t-hat- 
apply to products of any number of power series. One case of this is worth 
writing down, namely the expressions for the kth power of a series. 

Rule 4. Let f ZZ {a n }o°, and let k be a positive integer. Then 


f k 


ops 



(2.2.5) 


Example 3. 


Let /(n, k ) denote the number of ways that the nonnegative integer n 
can be written as an ordered sum of k nonnegative integers. Find /(n, k). 
For instance, /( 4,2) = 5 because 4=4-t-0=3d-l=2-h2=l-h3=0d-4. 

To find /, consider the power series 1/(1 — x) k . Sincel/(1— x) ZZ { 1 } , 
by (2.2.5) we have 

1/(1 — ^) fe ^ {f(.n,k)}™ =0 . 


(1.5.5) 


f(n, k) 


( n+k n 1 ) » an d we are finished. || 


Next consider the effect of multiplying a power series by 1/(1 — x). 
Suppose / TZ { a ™}o°- Then what sequence does /(x)/( 1 — x ) generate? 
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To find out, we have 

7- j- — — r = (do T T cl 2 x 2 + • • -)(1 + x + a; 2 + • • •) 

(1-x) 

= a 0 + (a 0 + ai)x + (a 0 + aq + a 2 )x 2 

+ (ao + d\ + a 2 + ct3)x 3 + • • • 

which clearly leads us to: 

Rule 5. If f ops {a n }o° then 

7-h— - = Jy>,} . 

( l ~ X ) Jn>0 

That is, the effect of dividing an opsgf by ( 1- x ) is to replace the sequence 
that is generated by the sequence of its partial sums. 


Example 4. 

Here is another derivation of the formula for the sum of the squares of 


the first n whole numbers. Since 1/(1 — x) yyy { l} n >o> we have by Rule 2, 
(xD) 2 ( 1/(1 — x)) yy {n 2 } n > 0, and by Rule 5 , 


1 — x 


(xD) : 


1 


1 — x 


n s 

3 = 0 J ? 


>0 


That is, the sum of the squares of the first n positive integers is the coeffi- 
cient of x n in the series 


1 


l — x 


{xDf 


1 — x 


x(l + x) 
(1 — x) 4 ' 



\x 


(1 -xY 


(T) 


xllh-x) (n + 2\ fn+l\ 

(T^F = V 3 J + ( 3 ) 

n(n + l)(2n + 1) 

= 6 ’ 


so this must be the sum of the squares of the first n positive integers 
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Example 5. 

The harmonic numbers {H n } ° 


H n 


1 + H 


00 are defined by 

1 


+ ••• + 


n 


(n > 1). 


How can we find their ops generating function? By 
is 1/(1 — x) times the opsgf of the sequence {1 /n}° 


Rule 5 


that function 


of reciprocals of the 
positive integers. So what is / = J/ n>1 x n /n? Well, its derivative is 1/(1 — 
x), so it must be — log (1 — x). That means that the opsgf of the harmonic 
numbers is 


E 

n= 1 


H n x n = 


1 


1 


x 


log 


1 


1 


X 


Example 6. 

Prove that the Fibonacci numbers satisfy 
F 0 + Fi + F 2 + ■ ■ ■ + F n = F n+2 - 


1 


(n > 0). 


By 


Rule 5 


the opsgf of the sequence on the left side is F/(l — x), where F 


is the opsgf of the Fibonac ci numbers, which we found in section 1.3 to be 


x/{l 

is 


x 


■x 2 ). By 


Rule 1 


the opsgf of the sequence on the right hand side 
F — x 1 


x 2 1 — x ’ 

and it is the work of just a moment to check that these are equal. 


Example 7. 

By a fountain of coins we mean an arrangement of n coins in rows such 
that the coins in the first row form a single contiguous block, and that in 
all higher rows each coin touches exactly two coins from the row beneath 
it. If the first row contains k coins, we will speak of an (n, /./-fountain. In 
Fig. 2.1 we show a (28, 12) fountain. 

Among all possible fountains we distinguish a special type: those in 
which every row consists of just a single contiguous block of coins. Let’s 
call these block fountains. 



Fig. 2.1: A (28, 12) fountain 
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The question here is this: how many block fountains have a first row 
that consists of exactly k coins? 

Let f(k ) be that number, for k — 0, 1, 2, . . . If we strip off the first row 
from such a block fountain, then we are looking at another block fountain 
that has k fewer coins in it. Conversely, if we wish to form all possible block 
fountains whose first row has k coins, then begin by laying down that row. 
Then choose a number j, 0 < j < k — 1. Above the row of k coins we will 
place a block fountain whose first row has j coins. If j = 0 there is just 
one way to do that. Otherwise there are k — j ways to do it, depending on 
how far in we indent the row of j over the row of k coins. It follows that 
/( 0) = 1 and 


/(*;) = J>-j)/(j) + 1 (fc = 1,2,...). 


( 2 . 2 . 6 ) 


j = i 


Define the opsgf F(x) = ]bb>o f(j)x J . The appearance, under the 
summation sign in (2.2.6), of a functio n of k — j times a function of j should 
trigger a reflex reaction that Rule 3, above, applies, and that the product 
of two ordinary power series generating functions is involved. The two 
series in question are the opsgf’s of the integers {j } and of the unknowns 
{fU)} i°, respectively. 

However the former opsgf is x / ( 1 — x) 2 . and the latter is F(x) — 1. 
Hence, after multiplying equation (2.2.6) by x k and summing over k > 1 
we obtain 

F(x) - 1= X (F(x)~ 1) + 


(i — x y 


1 — x ’ 


and therefore 


F(x) 


l-2x 


1 — 3x + x' 


(2.2.7) 


The sequence {/(&)} o° begins with 1, 1, 2, 5, 13, 34, 8 9, . . . If thes e num- 
bers look suspiciously like Fibonacci numbers, then see exercise 19. | 


2.3 The calculus of formal exponential generating functions 

In this section we will investigate the analogues of the rules in the 
preceding section, which applied to ordinary power series, in the case of 
exponential generating functions. 

Definition. The symbol f {a n }o° means that the series f is the ex- 
ponential generating function of the sequence {a n }o°, i.e., that 


/ = £ 


a 


71 


n> 0 


n\ 


X 
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Let’s ask the same questions as in the previous section. Suppose 
/ tfl {a n }o°- Then what is the egf of the sequence {a n+ i}o°? We claim 
that the answer is f, because 


OO j-. — 1 

E na n x 

n\ 

71=1 

OO j-, — 1 

E a n x 

(n- 1)! 

71=1 

OO Yf 

E Cln+lX 
n\ 


which is exactly equivalent to the assertion that /' {o n _|_i}o°. 

Hence the situation with exponential generating functions is just a trifle 
simpler, in this respect, than the corresponding situation for ordinary power 
series. Displacement of the subscript by 1 unit in a sequence is equivalent 
to action of the operator D on the generating function, as opposed to the 
operator (/(x) — /( 0))/x, in the case of opsgf’s. Therefore we have, by 
induction: 

Rule V . If f {on}o° then, for integer h > 0, 


{«„+/,}“ ‘Ji o h f- 


(2.3.1) 


The reader is invited to compare this Rule V with Rule 1, stated above. 

Example 1. 

To get a hint of the strength of this point of view in problem solving, 
let’s find the egf of the Fibonacci numbers. Now, with just a glance at the 
recurrence 

F n+2 = F n - i-i + F n (n > 0) 


we see from Rule 1 1 that the egf satisfies the differential equation 


/" = /' + /• 


At the corresponding stage in the solution for the ops version of this prob- 
lem, we had an equation to solve for / that did not involve any derivatives. 
We solved it and then had to deal with a partial fraction expansion in or- 
der to find an exact formula for the Fibonacci numbers. In this version, we 
solve the differential equation, getting 

f(x) = c\e r+x + c 2 e r - x (r± = (1 ± Vh)/2) 

where C\ and c 2 are to be determined by the initial conditions (which 
haven’t been used yet!) /( 0) = 0; /'( 0) = 1. After applying these two 
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conditions, we find that C\ — 1/yfh and C 2 = — 1/\/5, from which the egf of 
the Fibonacci sequence is 


/ = (e r+x - e r ~ x ) /VE. 


(2.3.2) 


Now it’s easier to get the exact formula, because no partial fraction expan- 
sion is necessar y. Just a pply the operator [x n /n!] to both sides of (2.3.2) 
and the formula 


(1.3.3) 


To compare, then, 


materializes. 

"(he ops method in this case involves an easier func- 
tional equation to solve for the generating function: it’s algebraic instead 
of differential. The egf method involves an easier trip from there to the 
exact formula, because the partial fraction expansion is unnecessary. Both 
methods work, which is, after all, the primary desideratum. | 

To continue, we discuss next the analogue of | Rule 2 |for egf’s, and that 
one is easy: it’s the same. That is, multiplication of the members of a 
sequence by a polynomial in n is equivalent to acting on the egf with the 
same polynomial in the operator xD, and we have: 

Rule 2'. If f {°n}o°) ail( l P JS a given polynomial, then 

P(xD)f ‘Jl {P(n)a n } n > 0 . 


Next let’s think about the analogue of jRule 3,| i.e., about what happens 
to sequences when their egf’s are multiplied together. Precisely, suppose 
/ { a n}o° an d g { bn } o° • The question is, of what sequence is fg the 
egf? 

This turns out to have a pretty, and uncommonly useful, answer. To 
find it, we carry out the multiplication fg and try to identify the coefficient 
of x n jn\. We obtain 


/» = iE 


a r x 

r 


r=0 


I 


j^b s x s 


s = 0 


SI 


Urbs ^ r +8 


r,s> 0 


r!s! 


■x 


E**{ E ^f} 

n > 0 ^ r+s=n ' 


The coefficient of x n /n\ is evidently 

(fd) = 


x 


n\ 


r+s=n 


n\a r b a 

r!s! 


E 


Q, r br), — 7 


We state this result as: 
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Rule 3'. Iff eg/ {o n }o° and g {frn}o°j then fd generates the sequence 



(2.3.3) 


This rule should be contrasted with 
multiplication of opsgf’s, the result of w’ 


Rule 3, | the corresponding rule for 
rich is to generate the sequence 



(2.3.4) 


We remarked earlier that the convolution of sequences that is shown in 

(2.3.4) is useful in counting problems where structures of size n are ob- 
tained by stitching together structures of sizes r and n — r in all possible 
ways. Correspondingly, the convolution (2.3.3) is useful in combinatorial 
situations where we not only stitch together two such structures, but we 
also relabel the structures. For then, roughly speaking, there are ("') ways 
to choose the new labels of the elements of the structure of size r, as well 
as a r ways to choose that structure and b n - T ways to choose the other one. 

Since this no doubt all seems to be very abstract, let’s try to make it 
concrete with a few examples. 


Example 2. 
In 


(1.6.13) we found the recurrence formula for the Bell numbers, which 


we may write in the form 


b(n + 1) = (? )K k ) (n > 0; 6(0) = 1). 


(2.3.5) 


We will now apply the methods of this sectio n to find the eg f of the Bell 
numbers. This will give an independent proof of Theorem 1.6.1, since (2.3.5) 


can be derived directly, as described in exercise 7 of chapter 1. 

Let B be the required egf. The egf of the left side of (2.3.5) is, by Rule 
T, B' . If we compare the right side of (2.3.5) with (2.3.3) we see that the 
egf of the sequence on the right of (2.3.5) is the product of B and the egf of 
the sequence whose entries are all l’s. This latter egf is evidently e x , and 
so we have 

B' = e x B 

as the equation that we must solve in order to find the unknown egf. But 
obviously the solution is B = cex p(e K ), and since B(0) = 1, we must 


have c = e 


-l 


Theorem 1.6.1 


fonr which B(x) = exp (e x — 1), completing the re-proof of 
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Example 3. 

In order to highlight the strengths of ordinary vs. exponential gen- 
erating functions, let’s do a problem where the form of the convolution of 
sequences that occurs suggests the ops form of generating function. We will 
count the ways of arranging n pairs of parentheses, each pair consisting of a 
left and a right parenthesis, into a legal string. A legal string of parentheses 
is one with the property that, as we scan the string from left to right we 
never will have seen more right parentheses than left. 

There are exactly 5 legal strings of 3 pairs of parentheses, namely: 

((())); ( 00 ); ( 0 ) 0 ; 000 ; ()(())• (2.3.6) 

Let- /(n) be the number of legal strings of n pairs of parentheses (/( 0) = 1), 
for n > 0. 

With each legal string we associate a unique nonnegat.ive integer k. as 
follows: as we scan the string from left to right, certainly after we have 
seen all n pairs of parentheses, the number of lefts will equal the number of 
rights. However, these two numbers may be equal even earlier than that. 
In the last string in (2.3.6), for instance, after just k = 1 pairs have been 
scanned, we find that all parentheses that have been opened have also been 
closed. In general, for any legal string, the integer k that we associate with 
it is the smallest positive integer such that the first 2k characters of the 
string do themselves form a legal string. The values of k that are associated 
with each of the strings in (2.3.6) are 3, 3, 2, 1, 1. We will say that a legal 
string of 2 n parentheses is primitive if it has k = n. The first two strings 
in (2.3.6) are primitive. 

How many legal strings of 2 n parentheses will have a given value of A:? 
Let w be such a string. The first- 2k characters of w are a primitive string, 
and the last 2 n — 2k characters of w are an arbitrary legal string. There 
are exactly /(n — k ) ways t-o choose the last- 2 n — 2k characters, but in how 
many ways can we choose the first 2 A’? That- is, how many primitive strings 
of length 2k are there? 

Lemma 2.3.1. If k > 1 and g(k ) is the number of primitive legal strings, 
and f(k ) is the number of all legal strings of 2k parentheses, then 

9 (k) = f(k — 1). 

Proof. Given any legal string of A: — 1 pairs of parentheses, make a primitive 
one of length 2k by adding an initial left parenthesis and a terminal right- 
parenthesis t-o it. Conversely, given a primitive string of length 2k, if its 
initial left- and terminal right parentheses are deleted, what- remains is an 
arbitrary legal string of length 2k — 2. Hence there are as many primitive 
strings of length 2k as there are all legal strings of length 2k — 2, i.e., there 
are f(k — 1) of them. |j 
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Hence the number of legal strings of length 2 n that have a given value 
of k is f(k — 1 )/(n — k ). Since every legal string has a unique value of k, it 
must be that 

f{n) = ^f{k -l)f{n-k) (n ± 0; /(0) = 1) (2.3.7) 

k 

with the convention that / = 0 at all negative arguments. 

The recurrence easily allows us to compute the values 1, 1, 2, 5, 14, . . . 
Now let’s find a generating function for these numbers. The clue as to 
which kind of generating function is appropriate comes from the form of 
the recurrence (2.3.7). The sum on the right is obviously related to the 
coefficients of the product of two ordinary power series generating functions, 
so that is the species that we will use. 

Let F = f(k)x k be the opsgf of {/(n)} n > 0 . Then the right side of 
(2.3.7) is almost the coefficient of x n in the series F 2 . What is it exactly ? 
It is the coefficient of x n in the product of the series F and the series 
f(k — l)x k . How is this latter series related to FI It is just xF. 
Therefore, if we multiply the right side of (2.3.7) by x n and sum over n^O, 
we get xF 2 . If we multiply the left side by x n and sum over n ^ 0, we get 
F — 1. Therefore our unknown generating function satisfies the equation 

F(x) — 1 = xF(x) 2 . (2.3.8) 


Here we have a new wrinkle. We are accustomed to going from recur- 
rence relations on a sequence to functional equations that have to be solved 
for generating functions. In previous examples, those functional equations 
have either been simple linear equations or differential equations. In (2.3.8) 
we have a generating function that satisfies a quadratic equation. When 
we solve it, we get 

1 ± \f\ — Ax 
2x 



Which sign do we want? If we choose the ‘+’ then the numerator 
will approach 2 as x — > 0, so the ratio will become infinite at 0. But our 
generating function takes the value 1 at 0, so that can’t be right. If we 
choose the ‘ — ’ sign, then a dose of L’Hospital’s rule shows that we will 
indeed have T(0) = 1. Hence our generating function is 


F(x) 


1 — y/l — Ax 
2x 


(2.3.9) 


This is surely one of the most celebrated generating functions in com- 


t-here is an explicit formula for them. For the moment, we declare that 
this exercise, which was intended to show how the form of a recurrence can 
guide the choice of generating function, is over. | 


binatorics. The numbers f{n) are the Catalan numbers, and in (2.5.10) 
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Example 4. 

By a derangement of n letters we mean a permutation of them that has 
no fixed points. Let D n denote the number of derangements of n letters, 
and let D{x ) {D n } q°. We will find a recurrence for the sequence, then 

L)(x). then an explicit formula for the members of the sequence. 

The number of permutations of n letters that have a particular set of 
k < n letters as their set of fixed points is clearly -D n _fc. There are ()';) 
ways to choose the set of k fixed points, and so there are exactly Q' j D n -}~ 
permutations of n letters that have exactly k fixed points. Since every 
permutation has some set of fixed points, it must be that 


n\ — 


( u )D n -k {n > 0 ). 


If we take the egf of both sides we get, by 

1 


Rule 3' 


1 — x 


= e x D{x) 


(see how easy that was?), from which D{x) = e x /{l — x ). Next, by Rule 
j~5j if we take [x n ] on both sides, we find that 


Dr 

n\ 


= 1-1 + 


1 1 
2! ~ 3! 


+ -” + (-l) f 


n\ 


and we are finished. ■ 

Just as in the case of ordinary power series generating functions, pleas- 
ant and useful things happen when we consider products of more than two 
exponential generating functions. For instance, if we multiply three of them, 
/, g, and h, which generate a, b, and c, respectively, then we find that 


fgh 


egf 


E 


n\ 


„ r+s+t=n 


r!s!t! 


a r b s ct 


(2.3.10) 


n= 0 


and therefore such operations can be expected to be helpful in dealing with 
sums that involve multinomial coefficients. 

If f eg/ {a n }8° then 


r 


egf 


E 


n\ 


. rH h rk=n 


a ri a r2 

ri\r 2 \ ■ ■ -r k \ 




n = 0 


(2.3.11) 
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2.4 Power series, analytic theory 

The formal theory of power series shows us that we can manipulate 
recurrences and solve functional equations, such as differential equations, 
for power series without necessarily worrying about whether the resulting 
series converge. If they do converge though, and they represent functions, 
that’s a big advantage, for then we may be in a position to find analytic 
information about the recurrence relation that might not otherwise be easily 
obtainable. 

In this section we will review the basic analytic properties of power 
series and their coefficient sequences. 

First, suppose we are given a power series 

/ = J ^a n z n , 

n> 0 


where we now use the letter z to encourage thinking about complex vari- 
ables. Question: for exactly what set of complex values of z does the series 
/ converge? We want to give a fairly complete answer to this question, and 
express it in terms of the coefficient sequence {a n }o°. 

Theorem 2.4.1. There exists a number R, 0 < R < Too, called the radius 
of convergence of the series f, such that the series converges for all values 
of z with \z\ < R and diverges for all z such that \z\ > R. The number R 
is expressed in terms of the sequence {a n }o° of coefficients of the series by 
means of 


R 


1 

lim sup n _ >00 | On | !/ n 


(l/0 = oo;l/oo = 0). 


(2.4.1) 


Before proving the theorem, we recall the definition of the limit superior 
of a sequence. Let {iCn}o° be a sequence of real numbers, and let L be a 
real number (possibly = ±oo). 

Definition. We say that L is the limit superior (‘upper limit’) of the 
sequence {x n } if 

(a) L is finite and 

(i) for every e > 0 all but finitely many members of the 
sequence satisfy x n < L + e, and 

(ii) for every e > 0, infinitely many members of the sequence 
satisfy x n > L — e, or 

(b) L = Too and for every M > 0, there is an n such that x n > M , 
or 

(c) L = — oo and for every x, there are only finitely many n such that 
x n > x. 

If L is the limit superior of the sequence {s n }g°, then we write L = 
limsupn^oolxn}, or perhaps just L = limsup{a: n }, if the context is clear 
enough. 
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The limit superior has the following properties: 

• Every sequence of real numbers has one and only one limit superior 
in the extended real number system (i.e. , including ±oo). 

• If a sequence has a limit L, then L is also the limit superior of the 
sequence. 

• If S is the set of cluster points of the sequence {x n }o°, then 
limsup{x n } is the least upper bound of the numbers in S. 


Proof of theorem 2.4. lJ Let R be the number shown in 
suppose first that 0 < R < oo. Choose z such that \z\ < R. 
that the series converges at z. 

For the given z, we can find e > 0 such that 


(2.4.1) 
We will 


and 

show 


z < 


R 

1 + eR 


Now, by the definition of the limsup, there exists N such that for all n > N 
we have 

\a n \ 1/n < ]- + e. 

IL 

Hence, for these same n, 


\a r 


z ' < 


M( + e ) 


n 


Let a denote the number in the curly brace. Then by our choice 
of e, we have a < 1. Hence the series a n z n converges absolutely, by 
comparison with the terms of a convergent geometric series. Therefore our 
series converges absolutely at z, and hence it does so for all \z\ < R. 

Next we claim the series diverges if \z\ > R. Indeed, we will show 
that for such z, the sequence of terms of the series does not approach zero. 
Since \z\ > R : we can choose e > 0 such that if 9 = \ (z/R) — ez |, then 
9 > 1. By definition of the limsup, for infinitely many values of n we have 
\a n \ li/n > (1 / R) — e. Hence, for those values of n, 


a n z n | > 



= e n 


which increases without bound since 9 > 1. Hence, that subsequence of 
terms of the power series does not approach zero and the series diverges. 
This completes the proof of the theorem in the case that 0 < R < oo. The 
cases where R — 0 or R = Too are similar, and are left to the reader. §j 

Theorem 2.4.2. Suppose the power series Y2 a nZ n converges for all z in 
\z\ < R, and let f(z) denote its sum. Then f(z ) is an analytic function in 
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\z\ < R. If furthermore the series diverges for \z\ > R, then the function 
f(z) must have at least one singularity on the circle of convergence \z\ = R. 

In other words: a power series keeps on converging until something 
stops it, namely a singularity of the function that is being represented. 
Proof. If / has no singularity on its circle of convergence \z\ = R, then 
about each point of that circle we can draw an open disk in which / remains 
analytic. By the Heine-Borel theorem, a finite number of these disks cover 
the circle \z\ = R. and therefore / must remain analytic in some larger disk 
\z\ < R + e. By Cauchy’s inequality, the Taylor coefficients of the series for 
/ satisfy \a n \ < AI/(R + e) n , for all n, and so the series must converge in a 
larger disk, a contradiction. | 

Example 1. 

The series 'Yhz n converges if \z\ < 1 and diverges if \z\ > 1. Hence the 
function that is represented must have a singularity somewhere on the circle 
\z\ = 1. That function is 1/(1 — z), and sure enough it has a singularity at 
z = 1. ■ 

Example 2. 

Take the function f(z) = 1/(2 — e z ). Suppose we expand f(z) in a 
power series about z = 0. What will be the radius of convergence of the 
series? 

According to the theorem, the series will converge in the largest disk 
\z\ < R in which / is analytic. The function fails to be analytic only at- 
tire points z where e z = 2. Those points are of the form z = log 2 + 2hiri, 
for integer k, and the nearest one to the origin is log 2. Therefore f(z) is 
analytic in the disk \z\ < log 2 and in no larger disk. Hence the radius of 
convergence of the series will be R = log 2. 9j 

Remember that, if f(z) is given, the best way to find the radius of 
convergence of its power series expansion about the origin may well be to 
look for its singularity that is nearest to the origin. 

Example 3. 

Take the function f(z) = z/{e z — 1) (/( 0) = 1). Estimate the size of the 
coefficients of its power series about the origin directly from the analyticity 
properties of the function. 

This is where things start getting more interesting. This f(z') is ana- 
lytic except possibly at points z where e z = 1, i.e., except possibly at the 
points z = 2k 7n for integer k. The nearest of these to the origin is the origin 
itself (k = 0). However, / is not singular at z = 0 because even though the 
denominator of / is 0 there, the numerator is also, and L’Hospit-al’s rule, or 
whatever, reveals that the value /( 0) = 1 removes the singularity. Hence 
the singularity of this function that is nearest to the origin is at z = 2 tt i. 
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The power series 


t = £ 


a n z 


n = 0 


therefore has radius of convergence R = 27t. 

The problem asks for estimates of the sizes of the c oefficients |a n l n°- 


But since the radius of convergence is 27 r, we have, from theorem 2.4.1 


limsup |a n | 1/n = 

Z7T 

It follows that, first of all, for all sufficiently large values of n we have 

l a n| J/n < 77 - + e, 

and for infinitely many values of n we have 

M 1/n > — - e. 

1 1 2t r 

Therefore, what we find out about the coefficients is that for each e > 0, 
there exists N such that 


(n > N ) 


°„i < |T + £ 


and further, for inhnitely many values of n. 

Ianl > 1 2^“ 


Therefore the coefficients of this series decrease to zero exponentially fast, 
at roughly the rate of l/(27r) n , for large n. This is quite a lot to have 
found out about the sizes of the coefficients without having calculated any 
of them! ■ 

We state, for future reference, a general proposition that summarizes 
what we learned in this example. 

Theorem 2.4.3. Let f(z) = a n,z n ' be analytic in some region containing 
the origin, let a singularity of f(z ) of smallest modulus be at a point zq ^ 0, 
and let e > 0 be given. Then there exists N such that for all n > N we 
have 

l N n 

|Onl < I 7 7 + e 


Further, for infinitely many n we have 

1 




— e 


\zo\ 
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In 


chapter 5 we will learn how to make much more precise estimates of 


the sizes of the coefficients of power series based on the analyticity, or lack 
thereof, of the funct ion that is repres ented by the series. For instance, the 
method of Darboux Theorem 5.3.1) is a powerful technique for asymptotic 


analysis of coefficient sequences of generating functions. The existence of 
such methods is an excellent reason why we should be knowledgeable about 
the analytic, as well as the formal, side of the subject of generating func- 
tions. 

Another path to the asymptotic analysis of coefficient sequences flows 
from Cauchy’s formula 


1 f f{z)dz 

2m J z n+1 


{n = 0,1,2,...) 


(2.4.2) 


that expresses the nth coefficient of the Taylor’s series expansion f(z ) = 
^ ~2a n z n as a contour integral involving the function /. The contour can 
be any simple, closed curve that encloses the origin and that lies entirely 
inside a region in which / is analytic. 

One has immediately, from (2.4.2), Cauchy’s inequality, which states 


that 


On 


< 


M(r) 

y*n 


and which holds for all n > 0 and all 0 < r < R, where R is the radius of 
convergence of the series, and 


M{r) = max | f(z ) | = max|/(z)|. 

\z \<r \z\—r 


(2.4.3) 


Just as the analysis of Example 3 above is refined by the method of Dar- 


boux to a much more precise method of estimating the growth of coefficient 
sequ ences, s o is Cauchy’s inequality refined by the method of Hayman (The- 
orem 5.4 . 1 to another very precise tool for the same purpose. 

If a power series actually converges to a function, then we can use roots 
of unity to pick out a progression of terms from a series. For instance, how 
can we select just the even powers out of a power series? If the series 
represents a function /, then, as is well known, ( f(x ) + /(— x))/2 has just 
the terms that involve even powers of x from the series for f(x), and — 
/(— x))/2 has just the odd ones. 

But suppose, instead of wanting to keep every second term of the series, 
we want to keep only every third term? For instance, what function do we 
get if we take the exponential series and keep just the terms where the 
powers of x are multiples of 3? In other words, who is 


9 ( x ) = 


x 


3 n 


i>0 


(3n)! 


(2.4.4) 


2.4 Power series, analytic theory 


51 


Well, what makes the ( f(x ) + f(—x))/2 thing work is that the two 
square roots of unity, namely ±1, have the property that 


l n + (-l) r 


1, if n is even; 
0, if n is odd. 


Now here is a correspondingly helpful property of the three cube roots 
of unity 1, uj\, 0 J 2 : 


(1 + qff) = f 1, if 3\n; 

3 1 0, else. 


(2.4.5) 


Since that is the case, we have, for any convergent power series / = 
^ ^ d r x r ? 

f(,)UMUM = y %/ . (2.4.6) 

3 

r 

Since uj\ = and 002 = e^ 471 " 1 ^ 3 , we can unmask the mystery 


function g(x) in (2.4.4) 


as 


g{x) = - (e* + e UlX + 

= ^ (e x + 2e~ x / 2 cos (-^—) 


(2.4.7) 


Example 4. 

For hxed n , hnd 


A„ = £(-l)' 


n 
3 k 


We could do this one if we knew the function 


= £ ( f) x3k ’ 


3k 

because A n = /(— 1). But f(x) picks out every third term from the series 
F(x) = (1 + x ) n , and so 

f(x) = (F(x) + F(uo x x) + F(lu 2 x ))/ 3 

= {(1 + x) n + (1 + ui lX ) n + (1 + iv 2 x) n } /3. 

Thus the numbers that we are asked to hnd are, for n > 0, 

A n = /(-l) = {(1 - ui) n + (1 - U2 ) n }/3 

1 |Y3-\/3 i\ n ( 3 + V3i 

-{(— ) +(- 

= 2 ■ 3( n / 2-1 ) cos ( — ). 

6 


(2.4.8) 
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The first few values of the {A n } n >o are 1, 1, 1, 0, —3, —9, —18 


To complete this example we want to prove the helpful property (2.4.5) 
of the cube roots of unity. But for every r > 1, the rth roots of unity do 
the same sort of thing, namely 

(2.4,9) 


UJ r = l 


Indeed, the left side is 


1 r_1 

?£■ 

3=0 


D {2irijn)/r 


which is a hnite geometric series whose sum is easy to find, and is as stated 
in (2.4.9). So, with more or less difficulty, it is always possible to select a 
subset of the terms of a co nvergent ser ies in which the exponents form an 
arithmetic progression. See exercise 25 


2.5 Some useful power series 

Generatingfunctionologists need reference lists of known power series 
and other series that occur frequently in applications of the theory. Here 
is such a list. For each series we show the series and its sum. The radius 
of the largest open disk, centered at the origin, in which convergence takes 
place will be, of course, the modulus of the singularity of the function that 
is nearest to the origin. Considering the relatively simple forms of the 
functions, the locations of those singularities will be sufficiently obvious 
that the radii of convergence are not explicitly shown in the table below. 


1 


1 — x 


n> 0 


log 


1 


1 — X 


£ 

n> 1 


X 


n 


n 


e x = 


£ 

n> 0 


n\ 


sinx 


£(-D 

n>0 


sg2,Tl-\-l 

(2nd- 1)! 


cosx 


£(-i> 

n> 0 


x^ n 
(2 n)\ 


(2.5.1) 

(2.5.2) 

(2.5.3) 

(2.5.4) 

(2.5.5) 
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(i + *)“ = £ 


a 


x 


1 


(1 — x) k+1 


E 


n + k 


n 


x 


x 


i = E 


B n x n 


e* — i * — ' n\ 

n> 0 


2n+l 


tan 




n>0 


(2.5.6) 

(2.5.7) 

(2.5.8) 

(2.5.9) 


1 

2x 


(1 - Vl -4x) = Y 


n + 1 



= 1 + x + 2a; 2 + 5a; 3 + 14a; 4 + 42a; 5 + 132a; 6 
+ 429a; 7 + 1430a; 8 + 4862a; 9 + • • • 


(2.5.10) 


1 

\/l — 4a; 


E®** (^.n) 

fc v 7 

1 + 2a; + 6a; 2 + 20a; 3 + 70a; 4 + 252a; 5 + 924a; 6 
+ 3432a; 7 + 12870a; 8 + 48620a; 9 + • • • 


a;cotx = ^^- — E*, 2k x 2k (2.5.12) 

fe> o ' 

rp 2 /y>4 O /y> 6 /y> 8 O /-vo 1 0 

it Lj tXj «!/ ^ tl/ 

~~ T ~~ 45 ~~ 945 ~~ 4725 ~~ 93555 


tanx = ^(-l) r ~ l 2 ~' (22 ' ., 1 ^ 2r x 2r ~ 1 (2.5.13) 

r>l ( 2r > 

a; 3 ( 2a; 5 | 17a; 7 | 62a; 9 | 1382a; 11 
_ a; + Y + + 315 + 2835 + 155925 + ’" 

21844 a; 13 | 929569 a; 15 
+ 6081075 + 638512875 + " ’ 
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— = V(-i)- i (4r : 2 * g2r x 2 - 
since / (2 r) 

r>0 V ’ 

x 2 7x 4 31x 6 

= 1 -)- -f- -f- + • • • 

6 360 15120 


(2.5.14) 


1 

y/l — 4x 





(2.5.15) 




E 

n> 0 


fc(2n + fc — 1)! n 

n!(n + fc)! 


(k > 1) 


(2.5.16) 


sin 1 (cc) 


1 x 3 

X+ 2Y + 


l-3x 5 


+ 


1 - 3-5x 7 
2-4-67 


+ ... 


(2.5.17) 


e x sin x 


E 

n> 1 


2 2 sin ^ 

-^-x n 

n\ 


x + x + 


X 


ar 

30 


x 6 

90 


x 

630 


+ 


(2.5.18) 


1 

2 


tan 1 (cc) log (1 + x 2 ) 


r>l 


CC 2r+1 

2r + 1 


x 3 hx b | 7cc 7 761x 9 
7 12" + To" ~~ 2520 + 


(2.5.19) 


1 

4 


tan 1 (cc)log 


1 + x 
1 — x 



(2.5.20) 


1 

2 



1 — cc 


= E 


r 

x 


r 


(2.5.21) 
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_ x 2 ( x 3 llx 4 : 5x 5 : 137x 6 : 7x 7 

~ ~2 + T + 24 + HT + 360 + "20" H 


— vl — x 


(2.5.22) 


^ 16 fc v / 2(2A’)!(2/c + 1)! 1 

1 / x 7x 2 33 a; 3 715a; 4 4199a; 5 

+ 8 + 128 + 1024 + 32768 + 262144 

52003 a; 6 334305 a; 7 17678835 a; 8 

+ 4194304 + 33554432 + 2147483648 
119409675 a; 9 1641030105 a; 10 \ 

+ 17179869184 + 274877906944 H ) 


^ nto(4 f + 1) 


arcsm x 


, , k+1 

h < 2fc+1 > ! 


rft 2 /y> 3 C /yj4 /yi5 1 ^7 /v>6 1 O /y» T 

tXj iXj KJ tXj * Aj JL I 1 U tX/ 

= 1+x + y + T + ir + _ 6 _ + i4r + i26" 

629 a; 8 325 a; 9 8177 a; 10 

+ W + W + -i45l62- + '" (2 ' 5 ' 23) 


2 oo 


A k kl 2 

(k + l)(2fc + l)f 


(2.5.24) 


a; 2 _ 8a; 4 4a; 6 i 128a; 8 i 128a; 10 

_ L + Y + l^ + ^ + 1575 + 2079 + " ' 


00 nk (a A: i -i \fc 

Z ' 12 2 ' 1 -' ^.fc 


+ vl + a; 2 )“ = — 2 2 — a; 

(1 + k/a)k\ 


2 2 
a z or 


_ Uj <Aj I Laj Uj \ Q 

= 1 + ax H b 1 ar + 

2 6 6 7 


(2.5.25) 


-a 2 a 4 \ 4 

x 

6 24 7 


a (9 — 10 a 2 + a 4 ) a; 5 i a 2 (64 — 20 a 2 + a 4 ) a; 6 

H 120 H 720 

a (-225 + 259 a 2 -35 a 4 + a 6 ) x 7 

+ ^40 

a 2 (-2304 + 784 a 2 -56 a 4 + a 6 ) a; 8 

+ + * * * 

40320 
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Iii the above, the {B n } are the Bernoulli numbers, and they are defined 
by (2.5.8). The Bernoulli numbers {B n } q 6 have the values 


. Ill 15 

1 , - 1 / 2 , -, 0 , , 0 , — , 0 , , 0 , — , 0 , 

’ ' ’ 6’ ’ 30’ ’ 42’ ’ 30’ ’ 66’ ’ 


691 7 3617 

2730 ’ ’ 6’ ’ ~~ 510 ' 


The {H n } are the harmonic numbers that were defined in section 2.2 
The symbol m k 


m 


(2.5.22)-(2.5.25) 


(2.5.25) 
are taken 


means m 


from [Ko 


m + !)••• (m+k — 1). The expansions 


2.6 Dirichlet series, formal theory 

We have already discussed two slightly different forms of generating 
functions of sequences, namely the ordinary power series form and the ex- 
ponential generating function form. We remarked that when, in a particular 
problem, one has to decide which of these forms to use, the choice is most 
often dictated by the form of the multiplicative convolution of the two se- 


quences that occurs in the problem. If the form is as in Rule ?>' and (2.3.3) 


then we choose the egf, whereas if it is of the form (2.3.4) the opsgf may 
well be preferred. 

To help highlight the basis for this kind of choice, we will now discuss 
yet another kind of generating function that matches yet another kind of 
convolution of two sequences, a kind that also occurs naturally in many 
problems in combinatorics and number theory. 


Definition. Given a sequence {a n }i°; we say that a formal series 


/( s ) = Z 

n= 1 


On 

n s 


. 02 03 04 

“° 1 + WF + F + 


(2.6.1) 


is the Dirichlet series generating function (Dsgf) of the sequence, and we 
write 

m Kjr- 

The importance of Dirichlet series stems directly from their multipli- 
cation rule. Suppose /(s) fTf {o n }i° and g(s) fTf {6n}i°- The question is, 
what sequence is generated by f(s)g(s)7 

To find out, consider the product of these series, 


f 9 = (oi + 022 s + a^3 s + • • -)(6i + 622 s + 633 s + ■ • •) 
= (fli&i) + (0162 + 0 2 6 i)2 * + (0163 + 0363)3 S 
+ (ai &4 + O262 + 0463)4 A T ■ ■ ■ 
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What- is the general rule? In the product fg, what is the coefficient of 
n~ s l It is the sum of all products of a’s and b's where the product of their 
subscripts is n, i.e. , it is 

a r 6 s . 

rs—n 

Now if rs = n then r and s are divisors of n, so the above sum can also be 
written as _ 

d\n 

in which the symbol L d\n’ is read Vi divides n .’ We state this formally as: 
Rule 1 " . If f(s) fb? {o n }f and g(s) ’dlf {6 n }i°, then 

} oo 

(2.6.2) 

n= 1 


f( s )g{ s ) 



Let’s hasten to say what kind of a problem gives rise to this kind of a 
convolution of sequences. It is, roughly, a situation in which all objects of 
size n are obtained by stitching together d objects of size n/d, where d is 
some divisor of n. Before we get to examples of this sort of thing, since the 
multiplication is so important, let’s look at a few more of its properties. 

What happens to the sequence generated if we take the kth power of 
a Dirichlet series? Let’s work it out, as follows: 



This shows: 

Rule 2 ". If f(s) ffZ { a n}i° then f(s) k fhf a sequence whose nth member 
is the sum, extended over all ordered factorizations of n into k factors, of the 
products of the members of the sequence whose subscripts are the factors 
in that factorization. 

What- series / generates the sequence of all Ls: {1}J°? When we asked 
that- question in the cases of the opsgf and the egf, the answers turned out- 
t-o be ‘famous’ functions. For opsgf’s it- was 1/(1 — x) and for egf’s it was e x . 
In the present- case, the formal Dirichlet series whose coefficients are all l’s 
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is not related to any simple function of analysis, it is a new creature, and 
it gets a new name: the Riemann zeta function. It is the Dirichlet series 


CM = E 

n = 1 


l 

n s 


= l -s + 2~ s + 3~ s + 4 -s + • • • , 


(2.6.3) 


and it is one of the most important functions in analysis. 


Mow, sin ce C(s) Dir ) {l}f°, what sequence does £ 2 (s) generate? Directly 


from 


(2.6.2) 


In 


1C 2 


s = 


5^1 • 1 = d{n ), 

d\n 


where d{n) is the number of divisors of the integer n. The sequence d(n) is 
quite irregular, and begins with 


1 , 2 , 2 , 3 , 2 , 4 , 2 , 4 , 3 , 4 , 2 ,... 


Likewise, £(s) fc generates the number of ordered factorizations of n 
into k factors. If the factor 1 is regarded as inadmissible, then (£(s) — l) k 
generates the number of ordered factorizations of n in which there are k 
factors, all > 2. 

One can go on and study further examples of interesting number- 
theoretic sequences that are generated by relatives of the Riemann zeta 
function, but there is a somewhat breathtaking generalization that takes in 
all of these at a single swoop, so let’s prepare the groundwork for that. 

Definition. A number-theoretic function is a function whose domain is 
the set of positive integers. A number-theoretic function f is said to be 
muitipiicative if it has the property that f(mn ) = f(m)f(n ) for aii pairs of 
relatively prime positive integers m and n. 

Since every positive integer n is uniquely, apart from order, a product 
of powers of distinct primes, 


Nevertheless, its Dirichlet series generating function is £ 2 (s), by 


Rule 2' 


n = pW-p?h (2.6.4) 

it follows that a multiplicative number-theoretic function is completely de- 
termined by its values on all powers of primes. Indeed, 

/(n) = /(Pi 1 )/(pS a )" ■/(??*■)■ (2-6.5) 


For instance, suppose that I have a certain function / in mind. It is 
multiplicative and, further, for every prime p and positive integer m we 
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have f(p m ) = v 2rn . Well t hen, it must be that f(n) = n 2 for all n, because 


if n is as shown in 


(2.6.4) 


then 


/(«)=/(]>?')= nwi 


= n = n 


p. 


CL-i 


= n 2 , 


as claimed. 

Another, less obvious, example of a multiplicative function is d(n), the 
number of divisors of n. For instance, 

6 = d(l2) = d( 3 ■ 4) = d(3)d(4) = 2-3 = 6. 

To see that d(n) is multiplicative in general, let m and n be relatively prime 
positive integers. Then every divisor d of mn is uniquely the product of a 
divisor d! of m and a divisor d" of n. Indeed, we can take d! = gcd(d , m) 
and d" = gcd(d. n ). Therefore the number of divisors of mn is the product 
of the number of divisors of m and the number of divisors of n, which was 
to be shown. 

It is quite easy, therefore, to dream up examples of multiplicative func- 
tions: let your function / do anything it likes on the powers of primes, then 
declare it to be multiplicative, and walk away. 

Multiplicative number-theoretic functions satisfy an amazing identity, 
which we will state, then prove, and then use. 

Theorem 2.6.1. Let f be a multiplicative number-theoretic function. 
Then we have the formal identity 

~T~ = II i 1 + f(p)P~ S + f(p 2 )P~ 2s + f(p 3 )P~ 3s 4 } (2-6.6) 

n= 1 p 

in which the product on the right extends over all prime numbers p. 

Proof. Imagine, if you will, multiplying out the product that appears on 
the right side of (2.6.6). Each factor in that product is an infinite series. 
The product looks like this, when spread out in detail: 

(1 + /( 2)2“ s + /( 2 2 )2 “ 2s + /( 2 3 )2“ 3s + • • -)x 
(1 + /( 3)3“ s + /( 3 2 )3 “ 2s + /( 3 3 )3“ 3s + ■ • -)x 
(1 + /( 5)5“ s + /( 5 2 )5 “ 2s + /( 5 3 )5“ 3s + ■ ■ -)x 
(1 + /( 7)7“ s + /( 7 2 )7 “ 2s + /( 7 3 )7“ 3s + ■ • •) x 


(2.6.7) 
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To multiply out a bunch of formal infinite series like this, we reach into 
the first parenthesis and pull out one term, for instance /( 2 3 )2“ 3s . Then 
we reach into the second parenthesis, pull out one term, say /( 3)3 -s and 
multiply it by the one we got earlier. This gives us an accumulated product 
(so far) of 

/(2 3 )/(3)2 -3s 3 -s = /( ^|, (3) - (2-6.8) 

Suppose, just as an example, that in all of the following parentheses we 
exercise our choice of one term by pulling out the term ‘1.’ Then, as a 
result of having made all of those choices, one out of each parenthesis, the 
contribution to the answer would be the single term shown in (2.6.8) above. 

Now here’s the interesting part. That particular set of choices has 
produced a term that involves (24) _s . What other sequence of choices of 
a single term out of each parenthesis would also lead to a net contribution 
that involves (24) _s ? The answer: no other set of choices can do that. 

Indeed, if from the first parenthesis we choose any term other than 
/( 2 3 )2 _3s , then no matter what terms we pull out of all following paren- 
theses, there is no way we will ever find the power of 2, namely 2 -3s , that 
occurs in (24) -s . We need three 2’s, and no other parenthesis has any 2’s 
at all to offer, so we’d better take them when we have the chance. 

Similarly, we need a factor of 3 -s in order to complete the formation 
of the term (24 ) ~ s . There are no 3’s available in any parenthesis other than 
the second one, and there, to get the right number of 3’s, namely one, we 
had better take the term /( 3)3 -s that we actually chose. 

Thus, the coefficient of (24) -s on the right side of (2.6.6) is just what 
we found, namely /(2 3 )/(3). Now since / is multiplicative, that’s the same 
as /( 24). Hence the coefficient of (24 ) ~ s is /( 24). But that is just what 
the left side of (2.6.6) claims. 

lat agai n, using ‘ n ’ instead of ‘24.’ Let n be some fixed 
be its factorization into prime powers. In order to 
ves n ~ s , i.e., that involves Y\.Pf aiS i we are f° rce d to 


(2.6.4) 


Let’s say t, 
integer, and let 
obtain a term that invo 
choose the ‘1’ term in every parenthesis on the right side of (2.6.7), except 


for those parentheses that involve the primes pi that actually occur in n. 
Inside a parenthesis that belongs to pi, we must choose the one and only 
term in which pi is raised to the power with which it actually occurs in n. 
else we won’t have a chance of getting n~ s . Thus we are forced to choose 
the term /(]:>'■ ' )pf a "' s out of the parenthesis that belongs to p t . That means 
that the coefficient of n~ s in the end will be 


II /W) = /(»), 


by 


(2.6.5) 


Let’s look again at 


( 2 . 6 . 6 ) 


multiplicative function is comp 


One thing that is very apparent is that a 
etely determined by its values on all prime 
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powers. Indeed, on the right side of (2.6.6) 
prime powers, but on the left, all values ap 
Try an example of the t heorem. 
f(n) = 1 (all n). Then (2.6.6) | says that 


we see only the values of / at 
lear. 

Take the multiplicative function 


c( s ) = n i 1 +p s +p 2s h — } 

p 

= n{iv^} (2'69) 

1 

- n P (i-p- s )’ 


which is a fundamental factorization of the zeta function. 

For another example, take the multiplicative function n(n) whose val- 
ues on prime powers are 


+ 1, if a = 0; 

hO“) = { if a = 1; 

0, if a >2. 


With this function substituted for / in (2.6.6) 


formidable series in the braces now has only two terms, and (2.6.6) 


one sees that the once 
reads 


£^ = I Itl-P-}. (2.6.10) 

n> 1 p 


An important fact emerges by comparison of (2.6.9) with (2.6.10): the 
series £(s) and the series on the left side of (2.6.10) are reciprocals of each 
other. Hence, 


1 

Oh 


£ 

n> 1 


/i(n) 




or, what amounts to the same thing, l/£(s) fFf {/i(n)}f°. 

The function /r(n) is the Mobius function , and it plays a central role 
in the analytic theory of numbers, because of the fact that it is generated 
by the reciprocal of the Riemann zeta function. For instance, watch this: 
Suppose we have two sequences {a n }^° and { b n } , and suppose that these 

two sequences are connected by the following equations- 


a n = s ^b d (n > 1). (2.6.11) 

d\n 


The question is, how can we invert these equations, and solve for the &’s in 
terms of the a’s? 
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Nothing to it. Let the Dsgf’s of the two sequences be A(s) a nd B(s). 


Then, if we take a step into Generatingfunctionland, we see that (2.6.11) 
means 

Ms) = B(s)C(s) 


by Rule l" Hence B(s) = A(s)/((s), and then from Rule 1" again, 

a d (n = 1,2,3,...) (2.6.12) 


h n = ^2 V 

d\n 


This is the celebrated Mobius Inversion Formula. The reciprocal relation- 
ships (2.6.11) and (2.6.12) of the sequences mirror the reciprocal relation- 
ships of their Dsgf’s £(s) and l/£(s). 

Example 1. Primitive bit strings. 

How many strings of n 0’s and l’s are primitive , in the sense that such 
a string is not expressible as a concatenation of several identical smaller 
strings? 

For instance, 100100100 is not primitive, but 1101 is. 

There are a total of 2 n strings of length n. Suppose f(n) of these are 
primitive. Every string of length n is uniquely expressible as a concatenation 
of some number, n/d, of identical primitive strings of length d, where d is 
a divisor of n. 

Thus we have 


2 n = Y 1 f(d) (n= 1,2,...) 

d\n 


By (2.6.12) we have 


/(n)=£>(j) 2 J (n = 1,2,...) 


d\n 


(2.6.13) 


for the required number. 

Example 2. Cyclotomic polynomials 

Among the n roots of the equation x n = 1, the primitive nth roots of 
unity are those that are not also mth roots of unity for some m < n. Thus 
the 4th roots of unity are ±1, ±z, but ±1 are roots of x 2 = 1, so they aren’t 
primitive 4th roots. 

In general, the nth roots of unity are {g 27r * r / n }^“J , and the primitive 
ones are 

{ e 27rir/n| (0 < r < n — 1; gcd(r, n) = 1). 

So for each n there are exactly 4>(n) primitive nth roots of unity. 

The equation whose roots are all n of the n roots of unity is obviously 
the equation x n — 1 = 0. The question is this: what is the polynomial 
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$ n (x) of degree (f)(n) whose roots are exactly the set of primitive nth roots 
of unity? In other words, what can be said about the polynomial 

$«(*)= n ( x — e 27rir / n ) (n= 1,2,3,...)? 

0<r<n— 1 
gcd(r,n) = l 

The polynomials < h n (x) are called the cyclotomic (“circle-cutting”) polyno- 
mials. 

The important fact for answering this question is that 

\\^ d {x) = l-x n (n = 1,2,3,...). (2.6.14) 

d\n 


Indeed, the right side of the equation is the product of all possible factors 
(u — x) where u> is an nth root of unity, primitive or not. But every nth 
root of unity is a primitive cith root of unity for exactly one d < n, and 
that d is a divisor of n. 

In detail, if we have some nth root ui = e 27! "* r / n , then let g = gcd(r, n), 
d = n/g, and r' = r/g. Since uo = e 2ntr ! d we see that w is a primitive cith 
root of unity and that d\n. Thus every linear factor on the right side of 
(2.6.14) occurs in one and only one of the cyclotomic polynomials on the 
left side of (2.6.14), which proves the assertion. 

From (2.6.14) we will obtain a fairly explicit formula for the <f> n (x), 
by inverting the equati on to solv e for the T's. The form of the equation 
reminds us of the setup (2.6.11) for the Mobius inversion formula, but we 
have a product over divisors instead of a sum over divisors. A small dose of 
logarithms will convert products to sums, however, so we take the logarithm 
of both sides of (2.6.14), to get 


^ log $ d (x) = log (1 - x n ) 

d\n 


(n = 1,2,3,...). 


This is now precisely in the form 
it, the result being 


( 2 . 6 . 11 ) 


so we can use 


(2.6.12) 


to invert 


log 4> n (x) = log (1 “ X<1 ) ( n = 1, 2, 3, . . .). 


d\n 


Finally, we exponentiate both sides to obtain our “fairly explicit formula,” 

$n(x) = II( : 1 “ xd Y {n/d) (n = 1 , 2, 3, . . .). (2.6.15) 

d\n 


This is a good time to remember that the values of the Mobius function 
pi can only be ±1 or 0. So the exponents on the right side of (2.6.15) tell 
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us whether to omit a certain factor, which we do if [i = 0, to put it in the 
numerator (if // = 1), or to put it in the denominator (if // = —1). For 
instance, $12 (x) is 

(1 - xf (12) (l - x 2 ) m(6) (1 - x 3 f (4) (l - xY (3) (l - xY (2) (l - x 12 f (1) 

= (1 - x)°(l - x 2 ) l {l - x 3 )°(l - x 4 ) _1 (l - yyi - x 12 ) 1 
(1 — x 2 )(l — x 12 ) 

(1 — x 4 )(l — X 6 ) 

= 1 — X 2 + X 4 , 

which didn’t look much like a polynomial at all until the very last step! 

An important and beautiful fact about these polynomials is that the 
equation & n (z) = 0 can always be solved by radicals. That is, the solutions 
can always be obtained by a finite number of root extractions and rational 
operations. This is certainly not the case for general polynomial equations. 
As an example of this property we note the splendid fact that 

cos ^ = ^{-1 + Vl7 + ^2(17 — vT7) 

+ 2^17 + 3^17 - yj 2(17 - vTr) - 2\J 2(17 + vTr) 

The proof is fairly difficult, and can be found in Rademacher [Ra], 

Some applications of cyclotomic polynomials will appear in section 

4.10. 
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Exercises 

1. Calculate the first three coefficients of the reciprocals of the power series 
of the functions: 

(a) cos a; 

(b) (1 + x) m 

(c) 1 + t 2 + t 3 + t 5 + t 7 + f 11 + • • • 

2. Calculate the first three coefficients of the inverses of the power series 
for the functions: 

(a) sinx 

(b) tanx 

(c) x -b x 2 \J 1 + x 

(d) x + x 3 

(e) log(l -x) 

3. Let / be a formal power series such that f" + f = 0. Give a careful 
proof that f = A sin x + B cos x. 

4. Find simple closed formulas for the opsgf’s of the following sequences: 

(a) {n + 7 }q° 

(b) {l}f 

(c) {1,0, 1,0, 1,0, 1,0,...} 

(d) {l/(n+l)}r 

(e) {lj[n + 5) 1}q° 

(f) F l , 2 F 2 , 3 F 3 , 4 F 4 , . . . (the F ' s are the Fibonacci numbers) 

(g) {(n 2 + n + 1 )/n!}f° 

5. Use generating functions to prove that ())) = 2 n . 

6. Given positive integers n, k; define f(n, k ) as follows: for each way of 
writing n as an ordered sum of exactly k nonnegat-ive integers, let S be 
the product of those k integers. Then f(n,k ) is the sum of all of the S” s 
that are obtained in this way. Find the opsgf of / and an explicit, simple 
formula for it. 

7. Let /(n, k. h) be the number of ordered representations of n as a sum of 
exactly k integers, each of which is > h. Find Yh n /(ffi k, h)x n . 

8. Find the limit superior of each of the following sequences. In each case 
give a careful proof that your answer is correct. 

(a) 1,0, 1,0, 1,0,... 

(b) {(-1)"}§° 
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(c) {cos (■ mr/k)} n >o (fc ^ 0 is a fixed integer) 

(d) {l + ((-l)»A0}»>! 

(e) {n l / n } n >i 

9. Prove that if a sequence has a limit then its limit superior is equal to 
that limit. 

10. Prove that a sequence cannot have two distinct limits superior. 

11. Find the radius of convergence of each of the following power series: 


12 . 

13. 


(a) 

En> 1* 

n /{n 2 ) 

(b) 

1 + X 3 + X 6 + X 9 + x 12 + • 

(c) 

1 + hx 2 

+ 25x 4 + 125z 6 + 

(d) 

l + 2!x 2 

+ 4!x 4 + 6!x 6 + ■ 

(e) 

J2n>0 X 

77.! 


Finish the proof of 


theorem 2.4.1 


in the cases where R = 0 and R = oo. 


Show that if is a multiplicative function, then so is 


9( n ) = ^2f(d) (n = l,2, ...) 

d\n 


14. Euler’s function <f)(n ) is the number of integers 1 < m < n such that m 
is relatively prime to n. Show by a direct counting argument that 

^2<j)(d) = n (n = 1,2,...). 

d\n 


15. Show that each of the following functions is multiplicative. In each case 
find the value of the function when n is a prime power, and thereby find a 
formula for its value on any integer n. 

(a) Euler’s function </>(n) (use the results of problems 13, 14 above). 

(b) cr(n), which is the sum of the divisors of n. 

(c) The function |/x(n)|, which is 1 if n is not divisible by a square 
and 0 otherwise. 


16. For each of the functions defined in problem 15 above, find its Dirichlet 
series g enerating function by using Theorem 2.6.1. First substitute into 


(2.6.6) the values of the function at prime powers. Then try to sum the 


power series that occurs, in closed form. Finally, by comparing the product 
that results with 


(2.6.9) 


try to express your answer simply in terms of the 
Riemann zeta function. In each case the Dsgf can be simply expressed in 
terms of £(s), or small variations thereof. 


17. Find the Dsgf of each of the following sequences: 
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(a) Mr 

(b) M)r 

(c) {logn}r 


(d) {E d \ n d q }%L 1 

18. For each of the following identities: first check that the identity is 
sometimes correct by calculating both sides of the alleged equation when 
n = 1, 2, 3, 4, 5, 6, 7, 8; next find the Dsgf’s of the sequences on both sides 
of the claimed identity, observe that they are the same, and thereby prove 
the identity. Use the results of exercise 16 above. 

( a ) Ed\n^)= n ( n — 1) 

(b) Yd\n Md) = 0 if n > 2 and =1 if n = 1 

(c) Ys \nM<M0M) = 1 (n > 1) 


19. If {/(&)} i s the sequence in example 7 of section 2.2, show that f{k) = 
F 2k -i for k > 1, where the { F k } are the Fibonacci numbers. 


20. Prove the binomial theorem 




by comparing the coefficient of t n /n\ on both sides of the equation e t ^ x+y ' > = 
e tx e ty . Prove the multinomial theorem 


(xi H b x k ) n 


E 


n\ 


r iH h rk=n 


n! ■ • ■ r k \ 


rr ri . . . rr r k 

| X 1 X k 


by a similar device. 

21 . 

(a) Let T be a fixed set of nonnegative integers. Let /(n, k,T) be 
the number of ordered representations of n as a sum of k integers 
chosen from T. Find Yin f( n i ^ T)x n . 

(b) Let g(n, k, T ) be the number of ordered representations of n as a 
sum of k distinct integers chosen from T. Find Y n 9( n ’ k,T)x n . 

(c) Finally, let S. T be two fixed sets of nonnegative integers. Let 
/(n, k, S. T ) be the number of ordered representations of n as a 
sum of k integers chosen from T, each being chosen with a multi- 
plicity that belongs to S. Find Y n /( n ’ ^ T)x n . 

22. Let /(n) be the excess of the number of ordered representations of n 
as the sum of an even number of positive integers over those as a sum of 
an odd number of them. Find /(n) by finding Y n f( n ) xn and reading off 
its coefficients. 
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23. Let {B n } be the sequence of Bernoulli numbers defined by (2.5.8), and 
let m be a positive integer. By considering the generating function 

x(e mx - 1) 

e x — 1 


in two ways, find an evaluation of the sum of the rth powers of the first N 
positive integers as a polynomial of degree r + 1 in N, whose coefficients 
are given quite explicitly in terms of the Bernoulli numbers. 

24. 


(a) 

(b) 


Make a table of values of the classical Mobius function //(n) for 
n — 1,2,..., 30. 

Make a table of the values of the function /(n) of (2.6.13) for 
n = 1,2,..., 12. 


(c) Make a list of the primitive strings of length 6, and verify your 
value of /( 6). 


25. This problem is intended to show how generating functions occur in 
coding theory. An important question in coding theory is the following: for 
fixed integers n and d, what is the length A(n, d ) of the longest list of mbit 
strings of 0’s and l’s ( codewords ) such that two distinct codewords always 
differ in at least d bit positions? 

(a) Assign to each of the 2 n codewords (ei, . . . , e n ) a color , as follows: 
the color of e is )TL j e modulo 2 n. Show that if two codewords 
differ in just 1 or 2 coordinates, then they are assigned distinct 
colors in this scheme. 


(b) From part (a), show that A(n, 3) > 2 n /(2 n). 

(c) Let a,j be the number of codewords for which )L) r re r = j, for each 
j. Find the opsgf f(z) = )Tb ajz ? explicitly as a product. 

(d) If (3 r is the number of codewords of color r, then express /3 r in 
terms of the a ? ’s above. Then use the roots of unity method of 
section 2.4 to find that 


(3 r 


1 

2 n 


Y j 2 2a ^") e - 2 -^P- 

3 = 1 


for each r = 0, 1, . . . , 2n— 1. Here a and n" are defined by n = 2 a n" 
where n" is odd, and (6, c) denotes the g.c.d. of b and c. 

(e) Deduce that (3q is the largest of the /3 r ’s, and therefore find the 
stronger bound 


A(n, 3) > 


iy 2 2“o>") > 

2 n ' 

3= 1 


2 n 
2 n 
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(f) Use Parseval’s identity and the result of part (d) to find the vari- 
ance of the occupancy numbers /3 q, • • • , fan-i- Make an estimate 
that shows that the variance is in some sense very small, so that 
this coloring scheme is shown to distribute codewords into color 
classes very uniformly. 


26. Derive 


(2.5.7) from (2.5.6) 


That is, show that 


— n 
k 


= (-i y 


n + k — 1 
k 


27. Let D 


n) be the number of derangements of n letters, discussed in 


Example 4. 


(a) Find, in simple explicit form, the egf of {_D(n)}o°. 

(b) Prove, by any method, that 


D(n + 1) = (n + 1 )D(n) + (-l) n+1 (n > 0; D( 0) = 1) 


(c) Prove, by any method, that 

D(n + 1) = n(D(n) + D(n — 1)) (n > 1; D( 0) = 1; D{ 1) = 0). 


(c) Show that the number of permutations of n letters that have exactly 
1 fixed point differs from the number with no fixed points by ±1. 

(d) Let Dk(ri) be the number of permutations of n letters that have 
exactly k fixed points. Show that 


Dk ( n ) 

k,n>0 


x n y k 

n\ 


e ~x(i -y) 

1 — x 


28. Prove the following variation of the Mobius inversion formula. Let 
{a n (x)} and {b n (x)} be two sequences of functions that are connected by 
the relation 

a n(x) ='^2b™(x d ) (n= 1,2,3,...). 

d\n 

Then we have 


b n (x) = ^2y(^)a d (x n/d ) [n = 1,2,3,. . .). 


29. 

(a) Make a table of the values of (pin) for 1 < n < 25. 
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(b) As far as your table goes, verify that 0 is a multiplicative function, 
by actual computation. Then check by actual computation from your 
table, that the result stated in exercise 14 above is true when n = 20 
and n = 24. 

(c) Let n = p a where p is a prime number. What is 0(n)? 

(d) Use the results above to find a general formula for </>(n ) in terms of the 
prime factorization n = p^p^ 2 • • -p^ k of n. Use your result to calculate 
0(2592). 

(e) Find the Dirichlet series generating function of <f>(n), using (2.6.6), and 
express it in terms of the Riemann zeta function. 

(f) Apply the Mobius inversion formula to the result of exercise 18(a), and 
thereby “solve” 18(a) for (f>(n), to get an explicit formula for 0(n) that 
involves a sum of various values of the Mobius function. 

(g) Show that your answers to parts (d) and (f) of this problem are iden- 
tical, even though they look different. 


30. Find the Dirichlet series generating functions for the sequences 


(a) 

(b) 

(c) 



a n = \p(n)\, where // is the Mobius function. 


A number-theoretic function f(n) is strongly multiplicative if it is 
true that f(mn ) = f(m)f(n) for all pairs m,n of positive integers. 
Let A (n) be the strongly multiplicative function that takes the value 
— 1 on every prime, and A(l) = 1. Find its Dsgf, and then prove 


that 


£ *(<*) = 


1 if n is a square; 
0 otherwise. 


31. A Lambert series is a series of the form 


f( x ) = 


x 


n> 1 


X' 


and we then say that / is the Lambert series gf for the sequence {o n }. 
(Lambert series are only rarely used because they’re hard to analyze.) 

(a) Suppose / is the Lambert series gf of a sequence {a n }J°, and the 
same / is the opsgf of a sequence {fr n }i°- Find the b's in terms of 
the a’s. 
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(b) Thus prove the amazing identity 

v-t n{n)x n 


1 ~ x n 


where again /j, is the Mobius function. 

(c) Find the Lambert series generating function of Euler’s cf) function. 

32. Let a = {o n } n > o be a given sequence. Let S be the operator that 
transforms a into its sequence of partial sums: ( Sa) n — ao + ■ • • + a n , for 
n > 0. 

(a) If / is the opsgf of a, what is the opsgf of Sa? 

(b) If / is the opsgf of a and r > 0 what is the opsgf of S r al 

(c) What is S r a if a is the sequence of all l’s? 

(d) For a general sequence a, find an explicit formula, involving a single 
summation sign, for the nth member of the sequence S r a. 

(e) An unknown sequence a has the following property: if, beginning 
with a we iterate r times the operation S, of replacing the sequence 
by its sequence of partial sums, we obtain the sequence {1, 0, 0, . . .}. 
Find a. 


33. 


(a) Write out the first twelve cyclotomic polynomials. 

(b) If n = p a is a prime power, what is $ n (x)? 

(c) Show that for n > 1, 



34. Consider the following sequence of polynomials. 


ip n (x) = ^ xm (n = 1,2,...). 


1 < m < n 
gcd(m ,n) = l 


Thus Vq = x, -02 = x, V >3 = x + x 2 , -04 = x + x 3 , etc. 
(a) Show that 




d\n 


(b) Use the result of exercise 28| to show that 




d\n 
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(c) Show that at every primitive nth root of unity u we have ip n (w) = 
/i(n), and therefore the polynomial ip n (x) — fi{n ) is divisible by the 
nth cyclotomic polynomial Q n {x). 
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Chapter 3 

Cards, Decks, and Hands: The Exponential Formula 

3.1 Introduction 

In this chapter we will discuss a particularly rich vein of applications of 
the theory of generating functions to counting problems. The exponential 
formula, which is our main goal here, is a cornerstone of the art of count- 
ing. It deals with the question of counting structures that are built out of 
connected pieces. The structures themselves need not be connected, but 
tlieir pieces always are. The question is, if we know how many pieces of 
each size there are, how many structures of each size can we build out of 
those pieces? 

We begin with a little example. There is 1 connected labeled graph 
that has 1 vertex, there is 1 connected labeled graph that has 2 vertices, and 
there are 4 connected labeled graphs that have 3 vertices. These creatures 
are all shown in Fig. 3.1 below. 


1 2 


2 13 


12 3 


13 2 


A 


Fig. 3.1: The six labeled, connected graphs of < 3 vertices. 

Now think of graphs that have exactly 3 labeled v ertices bu t are not 


necessarily connected. There are 8 of them, as shown in Fig. 3.2 


The question is, how can we develop a theory that will show us the 
connection between the number 8, of all graphs of < 3 vertices, and the 
numbers 1, 1, 4 of connected labeled graphs of 1, 2, and 3 vertices? After all, 
such a theory should exist, because the connected graphs are the building 
blocks out of which all graphs are constructed. How, exactly, are those 
building blocks used? 

Suppose we want to construct a graph G of n vertices and k connected 
components. We can first choose which k connected graphs to use for the 
connected components, subject only to the condition that the sum of their 
numbers of vertices must be n. 

Second, after deciding which connected graphs to use, we need to re- 
label all of their vertices. That is because the connected graphs that we 
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1 


2 3 

• • 


12 3 

■ ■ • 


13 2 

* • • 


2 13 


12 3 



2 3 1 

> • • 

13 2 


Fig. 3.2: The eight not-necessarily-connected labeled graphs of 3 vertices. 


use, as m 


Fig. 3.1 


each have their own private sets of vertex labels. A 
connected graph of 5 vertices will have labels 1, 2, 3, 4, 5 on its vertices, 
etc. However, the final assembled graph G. that we are manufacturing out- 
of those connected pieces, will use each vertex label 1, 2, 3, . . . , n exactly 
once, as in Fig. 3.2. Note, for instance, that the connected graph of 1 
vertex appears three times in the first graph of Fig. 3.2 with 3 different 
vertex labels. 

So our counting theory will have to take into account the choices of the 
connected graphs that are used as building blocks, as well as the number 
of ways to relabel the vertices of those connected graphs to obtain the final 
product. 

Now we’re going to raise the ante. Instead of going ahead and an- 
swering these counting questions in the case of graphs, it turns out to be 
better to be a bit more general right from the start, because a lot of nice 
applications don’t quite fall under the heading of graphs. So we are going 
to develop the theory in a context of ‘playing cards’ and ‘hands,’ instead 
of ‘connected graphs’ and ‘all graphs.’ 

Next you will see a number of definitions of the basic terminology. 
After all of those definitions, a few examples will no doubt be welcome, and 
will be immediately forthcoming. Then we will get on with the development 
of the theory and its numerous applications. 


3.2 Definitions and a question 

We suppose that there is given an abstract set P of ‘pictures.’ 

Definition. A card C(S,p ) is a pair consisting of a finite set S (the ‘label 
set’) of positive integers, and a picture p G P. The weight of C is n = IS). 
A card of weight n is called standard if its label set is [n].* 


* Recall that [n] is the set {1, 2, . . . , n}. 
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Definition. A hand H is a set of cards whose label sets form a partition 
of [n], for some n. 

This means that if n denotes the sum of the weights of the cards in the 
hand, then the label sets of the cards in H are pairwise disjoint, nonempty, 
and their union is [n\. 

Definition. The weight of a hand is the sum of the weights of the cards 
in the hand. 

Definition. A relabeling of a card C(S,p ) with a set S' is defined if |Sj = 

1 5" | , and it is the card C(S',p). If S' = [|Sj] then we have the standard 
relabeling of the card. 

Definition. A deck D is a finite set of standard cards whose weights are 
all the same and whose pictures are all different. The weight of the deck is 
the common weight of all of the cards in the deck. 

Definition. An exponential family T is a collection of decks Tfi, V 2 , . . . 
where for each n = 1 , 2 ,..., the deck T> n is of weight n. 

If J- is an exponential family, we will write d n for the number of cards 
in deck V n , and we will call T>(x), the egf of the sequence {d n }i°, the deck 
enumerator of the family. 

Question: Given an exponential family T . For each n > 0 and k > 1, let 
h(n , k) denote the number of hands H of weight n that consist of k cards, 
and are such that each card in the hand is a relabeling of some card in 
some deck in T . Repetitions are allowed. That is, we are permitted to take 
several copies of the same card from one deck, and to relabel those copies 
with different label sets. 

How can we express h(n,k ) in terms of di,d 2 ,ds, . . ., where di is the 
number of different cards in deck T>i (i > 1)? 

If h(n, k ) is the number of hands H of weight n that have exactly k 
cards, then we introduce the 2-variable generating function 

n(x,y)= h(n,k)^-y k . (3.2.1) 

n\ 

n,k> 0 


This is a generator of mixed type; it is an opsgf with respect to the y 
variable and an egf with respect to x. We will call it the 2-variable hand 
enumerator of the family. 

If h(n ) = h(n, k ) is the number of hands of weight n without regard 
to the number of cards in it, then we write 7t(x) for the egf of {h(n)}, 
instead of 7i[x , 1). It is the 1-variable hand enumerator of T . 

One way to answer the question raised above would be to exhibit a 
simple relationship between the generating functions H(x,y) and V{x), 
and that, of course, is exactly what we are about to do (see 
for a look at the answer). 


(3.4.4) below 
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3.3 Examples of exponential families 

Before we get on with the business of answering the question that 
was raised in the previous section, here are a few examples of exponential 
families that have important roles in combinatorial theory. 


Example 1. 

The first exponential family that we will describe is the family of all 
vertex-labeled, undirected graphs. We will call this family T\. 

A graph G is a set of vertices some pairs of which are designated as 
edges. A labeled graph is a graph that has a positive integer associated 
with each vertex. The integers (‘labels’) are all different. The graph has 
the standard labeling if the set of its vertex labels is [n], where n is the 
number of vertices of G. 

There are (") possible edges in graphs of n vertices, so there are 2^ 2 ) 
labeled graphs of n vertices. F or instanc e, there are 8 labeled graphs of 3 
vertices, and these are shown in Fig. 3.2 (graphs are drawn by first drawing 
the n vertices and then, between each pair of vertices that is designated as 
an edge, drawing a line). 

Some graphs are connected and some are disconnected. A graph is con- 
nected if, given any pair of vertices, we can walk from one to the other along 
edges in the drawing of the graph. Ot herwise, t he graph is disconnected. 
Of the 8 graphs of 3 vertices, shown in Fig. 3.2 4 are connected, namely 
the last 4 that are pictured there. 

Now let’s describe our exponential family. 

First, we describe a card C(S,p). There is a card corresponding to 
every connected labeled graph G. The set S is the set of vertex labels that 
is used in the graph. 

Before we can describe the ‘picture’ on the card we need to say what 
a standard relabeling of a graph is. Let G be a graph of n vertices that are 
labeled with a set S of labels. Then relabel the vertices with [n], preserving 
the order of the labels. That is, the vertex that had the smallest label in 
S will then get label 1, etc. Therefore the standard relabeling is uniquely 
defined. 

Now, if G is a labeled, connected graph, the picture p on the card 
C(S,p ) that corresponds to G is the standard relabeling of G. Hence, on 
a card C we see two things: a picture of a connected graph with standard 
labels, and another set of labels, of equal cardinality. 

For instance, one card of weight 3 might be 


(S,p) = ({5,9, ) 

which would correspond to the connected labeled graph 

5 11 9 
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So cards correspond to connected graphs with not-necessarily-standard 
label sets. 

What is a hand? A hand is a collection of cards whose label sets 
partition [n\, where n is the weight of the hand, which is to say, it is the 
total number of vertices in all of the connected graphs on all of the cards 
of the hand. But that is something very useful; a hand H corresponds 
to a not-necessarily-connected graph with standard labels! Its individual 
connected components may have nonstandard labels, but the graph itself 
uses exactly the labels 1,2 , ,n, where n is its number of vertices. 

In summary then, the set of all vertex labeled graphs forms an expo- 
nential family. Each card is a labeled connected graph, each deck T> n is 
the set of all connected standard labeled graphs of n vertices, each hand 
is a standard (not-necessarily-connected) labeled graph. The number d n of 
cards in the nth deck is the number of standard connected labeled graphs 
of n vertices, and the number /z(n, k) of hands of weight n with k cards 
is the number of standard labeled graphs of n vertices with k connected 
components. The question posed at the end of the last section in this case 
asks for the relationship between the numbers of all labeled graphs and all 
connected labeled graphs of all sizes. | 

Example 2. 

In this example we will find that the set of all permutations can be 
thought of as an exponential family. 

First let’s say what the cards are. On a card, the picture will show n 
points arranged in a circle, the points being labeled with the set [n ] , in some 
order, and there will be arrowheads around the circle, all pointing clockwise, 
to tell us that the points are arranged in clockwise circular sequence. 

So much for the ‘picture’ part of the card. Additionally, there is a set 
S of n positive integers on the card. 

The reader will recognize that such a card corresponds to a cyclic 
permutation of the elements of S, i.e., a permutation of S that has a single 
cycle. For instance, the card whose picture is shown in Fig. 3.3 


4 

Fig. 3.3: A cyclic permutation is in the cards. 

and whose set is S = {2, 4, 7, 9, 10} represents the cyclic permutation 

2 — >7 — >4 — >10 — >9 — >2 



of the set S. 
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Now what is a deck of these cards? The cards in a deck are standard 
cards, and they consist of one sample of every distinct standard card of a 
given weight. In this case the nth deck D n contains exactly (n — 1)! cards, 
one for each cyclic permutation of [n]. 

So far we have accounted for the permutations with one cycle. They 
are the building blocks out of which all permutations are constructed, using 
hands of cards. 

So what is a hand , in this example? A hand is a collection of cards, 
and on each card there are two things: a cyclic permutation and a label 
set. The label sets are pairwise disjoint and their union is {1,2, ...,n}. 
The cardinality of the label set on each card matches that of the cyclic 
permutation that is shown there. The collection of all of the cards in the 
hand represents a permutation of n letters. The cycles of this permutation 
are the ones shown on the individual cards of the hand after the cycle on 
each card has been relabeled, in an order-preserving way, with the elements 
of the label set on the card. 

Since every permutation of n letters has a unique decomposition into 
cycles, we see that hands of weight n correspond exactly to permutations of 
n letters. 

Hence the set of all permutations is an exponential family. We call it 
Pi- 

How many cards are in deck T> n l There are d n = (n — 1)! of them. The 
question raised at the end of the last section asks for the number h(n. k ) 
of hands of weight n and k cards. Such a hand represents a permutation 
of n letters that has k cycles. Hence in this case h(n , k ) is the number of 
permutations of n letters that have k cycles. When we have our general 
theorems in place, the ones that give the relationships between the d n ’s 
and the h(n , A'j's, we’ll learn a lot about permutations of various kinds with 
given numbers and sizes of cycles. Later, in chapter 5, we’ll return to this 
subject and re-use these generating functions to get asymptotic information 
about permutations and their cycles. 

3.4 The main counting theorems 

In this section we will state and prove various forms of the exponential 
formula. The next section contains 10 9 applications of the method. 

First, let two exponential families be given. We will say what it means 
to merge them. Roughly, it means to form a new family whose decks of 
each weight are the unions of the decks of those weights in the two given 
families. Some care is necessary, however, to insure that the two decks have 
all different cards, so we will now give a precise definition. 

Let T' and T" be two exponential families whose picture sets P' . P" 
are disjoint. We form a third family T . and write T = T' © T " , as follows: 
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fix n > 1. From F' we take all of the d' n cards of deck T>' n and put them 
in a new pile. Then from F" we take all d" t of its cards from deck F" and 
add these d" cards to the pile, which now contains d n = d' n + d" different 
cards. Repeat this for each n > 1. 

The Ftindamental Lemma of Labeled Counting. Let F' , F" be two 

exponential families, and let F = F' © F" be their merger. Further, let 
1 i'(x,y), H"(x,y), H(x,y) be the respective 2-variable hand enumerators 
of these families. Then 


Ti{x,y) =n'(x,y)n"{x,y). 

Proof. Consider a hand H in the merged family F. Some of its cards 
came from F’ and some came from F" . The collection of cards that came 
from F’ forms a sub-hand H’ of weight, say, n' , and having k' cards, t-hat- 
has been relabeled, in an order-preserving way, with a certain label set 
S C [n]. All hands H in the merged family are uniquely determined by a 
particular hand H' from F’ , the choice of new labels S with which t-hat- 
hand is to be relabeled, and the remaining subhand hi" from F" , which 
must be relabeled, again preserving the order of the labels, with [n] — S. 

Consequently the number of hands in the merged family that have 
weight n and have exactly k cards is 


h(n , k ) 


y] ( U / j h'(n f , k’)h" (n — n , k — k ') 

n',k’ ^ ' 


X j. 

-ry k 

n\ 




(3.4.1) 


and we are finished. ■ 

The main idea is that the processes of merging families and of mul- 
tiplying egf’s correspond exactly. The fact that in equation (3.4.1) the n' 
variable in the sum carries a binomial coefficient along in its wake, while 
the k' does not, accounts for the mixed nature of the generating function 
that was chosen, with the ‘ad variable being egf-like and the 'if variable 
ops- like. 

The Fundamental Lemma will allow us to build up the general rela- 
tionship between deck and hand enumerators very easily, in a ‘Sorcerer’s 
Apprentice’ fashion, beginning with a trickle and ending with a flood. We 
begin with a starkly simple exponential family that consists of exactly one 
nonempty deck that has just one card in it. The hand enumerator there 
will be obvious. Then we consider a family that has a number of cards in 
one deck, and no other decks. Finally we jump to the general situation, at- 
each stage using the Fundamental Lemma, because we will be carrying out- 
a merging operation. 
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Step 1: The trickle. 

Fix a positive integer r. Let the rth deck, T> r . contain exactly one card, 
and let all other decks be empty. The deck counts are d r = 1 and all other 
dj = 0. The deck enumerator is T>(x ) = x r /r\. A hand H consists of some 
number, say s, of copies of the one card that exists. The weight of H is rs. 
Therefore the number of hands of k cards and of weight n is h(n. k) = 0 
unless n = kr. If n = kr, then how many hands of weight n are there? We 
can choose the labels for the first card in ways, for the second in ( n ~ r ) 
ways, etc, for the kth card in ( n- ^ -1 ' )r ) = 1 way. Since the order of the 
labeled cards is immaterial, the number of hands is therefore 

h (kr,k) = ^. 

The hand enumerator of this elementary family is therefore 


H(x,y) = ^ h(n, k)x n y k /n\ 

n,k 

r r kr 1 .k 

E x V 

k\ r \k 

k 



(3.4.2) 


the 


We won't have to do any more computation to get the general result; 


Fundamental Lemma 


will do it for us. 


Step 2: The flow 

Fix positive integers r and d r , and consider an exponential family T 
that has d T cards in its rth deck T) r . and has no other nonempty decks. We 
claim that the hand enumerator of this family is 

H{x, y) = exp | yd ^ j. (3.4.3) 


The proof is by induction on d r . The claim is correct when d r = 1, for 
that is (3.4.2). Suppose the claim is true for d r = 1, 2, . . . , m — 1, and let 
the family J~ have m cards in its rth deck. Then J~ is the result of merging 
a family with m—1 cards in the rth deck and a family with 1 car d in that- 


deck. By the inductive hypothesis and the 
enumerator is the product 


Fundamental Lemma 


the hand 


exp {y(m — l)x r /r!} exp {yx r /r!} = exp {ymx r /r\} , 
and the claim is proved. 


Step 3: The flood. 

We are now ready to prove the main counting theorem. 


3.5 Permutations and their cycles 
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Theorem 3.4.1 (The exponential formula). Let T be an exponential 
family whose deck and hand enumerators are V(x) and 7f(x. y). respec- 
tively. Then 

H(x,y) = e yV(x l (3.4.4) 

In detail, the number of hands of weight n and k cards is 


h(n , k ) = 


n 


X 

n\ 


v( x y 

~kT 


(3.4.5) 


Proof. In (3.4.3) we have proved this result in the special case where 
there is only one nonempty deck. But a general exponential family with 
a full sequence of nonempty decks T>i,T> 2 , ■ ■ ■ is the merger of the special 
families T r [r = 1,2,...), each of which has just a single nonempty deck V r 
By the 


Fundamental Lemma 


the hand enumerator of the general family 
is the product of the hand enumerators of the special families. But the 

.4) claimed in the theorem is indeed the product of 
of the special families J- r , and the proof is finished. 


generating funct ion (3. 


the enumerators (3.4.3) 


By summing (3.4.5) over all k we obtain the following: 

Corollary 3.4.1. Let T be an exponential family, let 'D(x) be the egf of 
the sequence {d n }i° of sizes of the decks, and let H(x) {^n)o° > where 
h n is the number of hands of weight n. Then 

U(x)=e D{x) . (3.4.6) 

By summing (3.4.5) over just those k that lie in a given set T, we 
obtain 

Corollary 3.4.2 (The exponential formula with numbers of cards 
restricted). Let T be a set of positive integers, let erix) = '^2 neT x n /n\, 
and let h n {T ) be the number of hands whose weight is n and whose number 
of cards belongs to the allowable set T . Then 

{MT)}S° VtL er(V(x)). (3.4.7) 


The next several sections of this chapter will contain applications of 
the exponential formula. 


3.5 Permutations and their cycles 

We apply the theorems to the exponential family JF 2 of permutations, 


that was described in example 2 of section 3.3 There we observed that the 
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deck V n contains d n = (n ■ 1 )! cards. The exponential generating function 
of the sequence {(n — l)!}f° is 


v ( x ) = 

n> 1 

= E 


n 


X 

n\ 


n> 1 


X 

n 


= log 


1 — x 


Now from 


theorem 3.4.1 


we have 


H(x,y) = exp y log 


1 — x 


(1 — x)t 


(3.5.1) 


In this exponential family, h(n , k ) is the number of permutations of 77- 
letters that have k cycles, and it is called the Stirling number of the first 
kind. We will use one of the standard notations, [£]*, for these numbers, 
and will reserve the h(n, k) for the general situation. 

Now, 


E 


77 


x n 

A 

y = 

n\ 


(1 — x) y 


-f + : (by (2.5.7)) 


(3.5.2) 


= y(y + 1) ■ ■ ■ {y + n - 1), 


so the numbers of permutations of n letters with various numbers of cycles 
are the coefficients in the expansion of the ‘rising factorial’ function y(y + 
1) • • ■ (y + n - 1). 

The enumerator of hands of k cards is obviously 


1 

k\ 



1 


1 — x 


k 


(k = 1,2,...), 


which tells us that the Stirling number is also given by 


77 

k 


x n 1 
77! k\ 



1 


1 — x 


k 


(3.5.3) 


* There are as many notations for ["] as there are books on combina- 
torics. It is called (—l) k s(n, k) or 51 ( 77 , k), or s(n,k), or c(n,k), or several 
other things. Similarly the { } are called 82 ( 77 , k ) or S(n,k ), etc. 


3.7 A subclass of permutations 
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One thing that we don’t find is a simple little formula for these Stirling 
numbers. One can find formulas for them, but they’re fairly unpleasant, 
involving double sums of summands with sign alternations, etc. But with 
the generating function apparatus we can do just about whatever we want 
to without such a formula. To calculate numerical values of the [('] , for in- 
stance, one can use the very simple r ecurrence re lations that can be derived 
from these generating functions (see 


Exercise 8 


3.6 Set partitions 

We introduce a new exponential family J ~:> , as follows: first, for each 
n > 1, in the deck V n there is just one card of weight n. On that card 
there is a picture of a smiling rabbit,* and there is the label set [n]. 

What is a hand? There is a hand H corresponding to every partition 
of the set [n]. Indeed, given such a partition, take the sets in it and let 
them relabel the label sets on the cards in the hand. Then the cards are 
otherwise uniquely determined since there’s only one card of each weight. 

So in this exponential family the number of hands of weight n that have 
k cards is equal to the number of partitions of the set [n] into k classes. 


But that is something we’ve met before, in example 6 of chapter 1, where 
we called those numbers the Stirling numbers of the second kind. 

To apply the exponential formula we first compute the egf of the num- 
bers d n of cards in each deck. But these numbers are all 1, if n > 1, and 
are 0 else, so 


®w = E^ = E^-i- 


n> 1 


Now by the exponential formula the enumerator of hands is 

H(x,y) = e y(e * _1) , 

and in particular 


J n \ _ 

x n 

f («*- 

UJ 

n\ 

l k\ 


( 3 . 6 . 1 ) 


( 3 . 6 . 2 ) 


Compare this result with the generating function (1.6.12) of the Bell num- 
bers and find that we have here a refinement of that generating function. 
Not only does e e _1 generate the numbers of partitions of n-sets. but each 
term of the expansion 


W-i 


E 

fc> o 


iy 


k\ 


has significance with respect to the numbers of classes in the partitions. 

* Why not? Since there’s only one card the picture is immaterial, so it 
might as well be cheerful. 
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3.7 A subclass of permutations 

How many permutations a of n letters have the property that a has 
an even number of cycles and all of them are of odd lengths? 

This problem takes place in an exponential family that is like the family 
J ~ 2 of permutations, except that it contains only the decks of odd weights, 

Vi,V 3 , The numbers {d n }i° that count the cards in the decks are now 

1, 0, 2, 0, 24, 0, 720, — The egf of the deck counts is 


»(*)= E (»-d'£ 

n odd 

2r + 1 


= £ 


= log 


1 + X 


X 


by 


(2.5.2), 

Since 


the number of cycles is required to be even, the allowab 


of cards in a hand are the set T=the even numbers. By 
the answer is 


(3.4.7) 


e numbers 
the egf of 


cosh 



1 + x 


x 


1 

\/l — x 2 



{x/2) 2m . 


The number of permutations that meet the conditions of the problem is the 
coefficient of x n /n\ here, namely 

/ n\ n\ 

That’s one way to answer the question, but the answer can be restated 
in quite a striking form, like this- 

Theorem 3.7.1. Let a positive integer n be fixed. The probabilities of 
the following two events are equal: 

(a) a permutation is chosen at random from among those of n letters, 
and it has an even number of cycles, all of whose lengths are odd 

(b) a coin is tossed n times and exactly n/2 heads occur. 


3.8 Involutions, etc. 

Fix positive integers m,n. How many permutations o, of n letters, 
satisfy o m = 1, where ‘F is the identity permutation? 

To do this problem, we need the following: 


3.9 2-regular Graphs 
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Lemma. For a m = 1 it is necessary and sufficient that all of the cycle 
lengths of a he divisors of m. 

Proof. Consider a cycle C of a, of length r. Let i be some letter that 
is in C. Then, by definition of a cycle, a m (i ) is the letter on C that we 
encounter by beginning at i and moving m steps around the cycle, namely 
the letter that is to mod r steps around C from i. But a m (i) = i ■ Therefore 
to mod r = 0, i.e. , r divides to. Therefore to is a multiple of the length of 
every cycle of C. The converse is clear, and the proof is finished. | 

Now back to the problem. Consider the exponential family J~\ in which 
the cards are the usual ones for cycles of permutations, but in which the 
only decks that occur are those whose weights are divisors of to. Then 
d r = (r — 1)! if r\m, and is 0 else. Hence 


V{x) = ^ d r x r /r\ 

r> 1 


E 



(3.8.1) 


By the exponential formula 
result: 


(theorem 3.4.1) we have the following elegant 


Theorem 3.8.1. Fix to > 0. The numbers of permutations of n letters 
whose mth power is the identity permutation have the generating function 


exp 



(3.8.2) 


Let’s try a special case of this theorem. Take to = 2. Then we are 
talking about permutations whose square is 1. These are called involutions. 
Involutions can have cycles of lengths 1 or 2 only, by the lemma above. If 
t n is the number of involutions of n letters, then by (3.8.2) we have 


E 

n> 0 


-X 


n\ 


= e 


x+hx 2 


(3.8.3) 


3.9 2-regular Graphs 

How many undirected, labeled graphs are there on n vertices, in which 
every vertex is of degree 2 (such graphs are called 2-regular )? 

Such a graph is a disjoint union of undirected cycles, so we have an ex- 
ponential family JF 5 in which the cards stand for undirected cycles, instead 
of directed ones, as in the case of permutations. 

For fixed n < 2 there are no undirected cycles at all. For n > 3, the 
number d n of cards in the nth deck is the number of undirected circular 
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arrangements of n letters, and that number is (n — l)!/2. Therefore the 
generating function of the deck sizes is 


(n — 1)! 


2 n\ 


v i x ) = Y 

n> 3 


-x 


n 


n> 3 


1 


= o { lo § 


1 


1 


X 


X 


X 

y 


By the exponential formula (3.4.4), the exponential generating function of 
the number g{n ) of undirected 2-regular labeled graphs is 


Y 9 ^ 


i>0 


x 


n\ 


= exp 


- log — 
2 6 1 - 


x 


2 4 


-± x -l x 2 

£> 2 ^ 4 


\/l — x 


(3.9.1) 


This answer is a sparkling example of the ability of the generating function 
method to produce answers to difficult counting problems with minimal 
effort. 


3.10 Counting connected graphs 

How many labeled, connected graphs of n vertices are there? 

Now we’re back in the exponential family T\ of labeled graphs, but 
there are one or two little twists. The exponential formula can tell you the 
number of all gadgets of each size if you know the number of connected 
ones, or vice versa. This problem is ‘vice versa.’ The number of all labeled 
graphs of n vertices is 2^ 2 \ so in the equation ‘Hands =e Decks ’ we know 
‘Hands’ and we want to find ‘Decks,’ rather than the other way around. 

There’s one more twist. Let V(x) and 7i(x) be the egf’s of the decks 
and the hands, respectively. Then 

v, 2© 

n M = 5? 

n>0 


and this series does not converge for any x ^ 0. So this is a formal power 
series generating function only, and we should not expect analytic functions 
at the end of the road. 

Having said all of that, the machinery still works very nicely. We 
will now find a recurr ence formula for the number of connected graphs by 
the ‘x-Dlog’ method of section 1.6. It isn’t any harder to find a general 
recurrence relation than for this special case, however, so let’s do it in 
general. 


3.11 Counting labeled bipartite graphs 
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Theorem 3.10.1. The counting sequences {d n } and {h n }, of decks and 
hands in an exponential family satisfy the recurrence 


nhn = l^jkdkhn-k (n>l;h 0 = l). 


(3.10.1) 


(3.4.6) 


Proof. Apply the 'xD log ’ method 


of section 1.6 to the exponential formula 


It follows that the numbers d n of connected labeled graphs of n vertices 
satisfy the recurrence 

77 , 2 ( 2 ) = ^ (^j kd k 2("' fc ) (n > 1). (3.10.2) 


From this formula we are able, for example, to compute the d n ‘s for small 
n. For 77 = 1, . . . , 6 we find the values 1, 1, 4, 38, 728, 26704. 


3.11 Counting labeled bipartite graphs 

How many bipartite vertex-labeled graphs of n vertices are there? 

The exponential formula can handle even this problem with just a little 
bit of coaxing. A bipartite graph G is a graph whose vertex set V ( G ) can 
be partitioned into V = A U B such that every edge of G is of the form 
(a, b ), where a G A and B. A bipartite graph of 10 vertices is shown in 
Fig. 3.4. 



Fig. 3.4: A bipartite graph 

Now, of the 2 ( 2 ) labeled graphs of n vertices, how many are bipartite? 

Well, there’s a little problem. The exponential formula can count the 
hands if you can count the decks, or it can count the decks if you can count 
the hands. But it can’t do both, and in this problem it isn’t immediately 
clear how many connected bipartite graphs there are or how many there 
are altogether. 

A thought might be to choose the sets A, B of the partition [ 77 ] = AUB, 
and then count the bipartite graphs that have that partition. The latter 
is easy; since there are |A||5| possible edges, there must be 2 lAl s l ways to 
exercise the freedom to draw or not to draw all of those edges. 
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The problem is that a fixed bipartite graph might get counted several 
times in the process. In other words, there may be several ways to exhibit 
a partition of the vertex set with all edges ru nning betwe en vertices in 
different classes. For instance, the graph G of Fig. 3.4 would turn up 
several times: once with A = {1, 4, 6 , 9}, again with A = ( 2 , 3, 5, 7, 8 , 10}, 
again with A = {1, 4, 5, 9}, etc. 

In general, a bipartite graph that has c connected components would 
be created 2 C times by the construction that we are considering, the reason 
being that for each connected component Gi of G we can choose which of 
the two sets in its vertex partition, Ai or Bi , will get put on the left hand 
side, in A , and which on the right hand side, in B. 

To get around this conundrum we use slightly different playing cards. 
By a 2-colored bipartite graph we mean a vertex-labeled bipartite graph G 
together with a coloring of the vertices of G in two colors (‘Red,’ ‘Green’), 
such that whenever (v. w) is an edge of G. then v and w have different- 
colors. 


A connected bipartite graph, for instance, creates two 2-colored graphs. 
A bipartite graph with c connected components creates 2 C such 2-colored 
graphs. 

In the exponential family JF 6 that we are making, there will be a card 
C corresponding to each 2-colored connected labeled bipartite graph. Im- 
printed on the card there will be, as always, S , the set of vertex labels 
that are used, and a picture of a 2-colored, connected bipartite graph of |*S'| 
vertices with standard vertex labels. 

What have we gained by coloring the cards? Just this: we now know 
how many hands of weight n there are. That number is 



(3.11.2) 


because each and every hand arises exactly once from the following con- 
struction: 

(i) fix an integer /c, 0 < k < n. 

(ii) choose k of the elements of [n] and color them ‘Red.’ 

(iii) color the remaining elements of [n] ‘Green.’ 

(iv) decide independently for each vertex pair (p, 7 ), where p is Red 
and 7 is Green, whether or not to make (p, 7 ) an edge. 

It is obvious that (3.11.2) counts the possible outcomes of the con- 
struction. 

So, even though we are in the wrong exponential family, because things 
are colored that we wish weren’t, at least we know how many hands there 
are! 

Next, let’s use the exponential formula to find the egf for the decks, 
which correspond to connected 2-colored bipartite graphs. It tells us in- 
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stantly that 


P(x) = log|^^x n | 

^n > 0 ' ) 


(3.11.3) 


where j n is defined by 
Now that we have 


(3.11.2) 

the connected colored graphs counted, is it hard to 
count the connected uncolored graphs? Not at all, because there are just 
half as many uncolored and connected as there are colored and connected. 
So the egf of ordinary, uncolored connected bipartite graphs is V(x)/2 , 
where T>(x) is given by (3.11.3). 

But now we have achieved, in the correct exponential family, the ob- 
jective that we had not reached before: we know how many cards there are 
in each deck. So we know one of the two items that the exponential formula 
relates, and therefore we can find the other one. 

Since V(x)/2 generates the deck counts, it must be that 


3 ®(*)/2 _ 


exp {5 log {£|b n } 

^ k n>0 J 


'£>’ 


(3.11.4) 


n> 0 


generates the hand counts, and we have: 

Theorem 3.11.1. Let /3(n ) denote the number of vertex labeled bipartite 
graphs of n vertices. Then 


£ 

n> 0 


Pin) 


x n = 


'£> 


n\ 


(3.11.5) 


n> 0 


where the y n are given by (3.11.2) 


So all of the complications about multiple counting were resolved by 
taking the square root of the generating function that we started with! 


3.12 Counting labeled trees 

A tree is a connected graph that has no cycles. How many (standard) 
labeled trees of n vertices are there? 

In this example we will derive the answer to that question in the form 
of one of the most famous results in combinatorics, namely: 

Theorem 3.12.1. For each n > 1 there are exactly n n ~ 2 labeled trees of 
n vertices. 

Although many proofs are known, the one by generating functions, 
which uses the exponential formula, is particularly enchanting, and here it 
is: 
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A rooted tree is a tree that has a distinguished vertex called the root. 
There are obviously n times as many labeled rooted trees of n vertices as 
there are trees, so we will be finished if we can count the rooted ones. 

Let t n be the number of rooted trees (with standard labels) of n vertices 
for n > 1. We define an exponential family F-j as follows. The cards 
correspond to rooted labeled trees. On a card C(S,p), p is a picture of a 
standard rooted tree of (S') vertices, and S is a set of labels. 

In what is a hand? A hand H corresponds to a rooted labeled 
forest , which is a labeled graph each of whose connected components is a 
rooted tree. The exponential formula will tell us how many forests there 
are if we know how many trees there are, or vice versa. But this is one of 
those unsettling situations where we know neither. The solution? Press on, 
and keep the faith. 

By the exponential formula, 

H(x) = e v{x \ (3.12.1) 

where H(x) FL {/ n }, F>(x) ??£ {f n } and f n is the number of rooted forests 
of n vertices. Now (3.12.1) is one equation in two unknown functions. To 
get another one we use a fact that was discovered by Polya, namely that 

t T i+i = (n + 1 )/ n (n > 0). (3.12.2) 

To prove (3.12.2), let F be a rooted labeled forest of n vertices. In- 
troduce a new vertex v. and assign to it a label j, where 1 < j < n + 1. 
Relabel F with the set 1, 2, . . . , j — 1, j + 1, . . . ,n + 1, preserving the or- 
der of the labels. Then draw edges between v and all of the roots of the 
components of F, and root the resulting tree at v. The result is a rooted 
labeled tree of n + 1 vertices. As we vary the label j. we construct n + 1 
rooted trees corresponding to each rooted forest F. The construction is 
easily reversible, so every rooted tree of n + 1 vertices occurs exactly once, 
which proves (3.12.2). g 

The sequence f n = t n +i/(n + 1) has the egf 

H{x) = — \x n 

^ n\ 

n> 0 

= Y-h±l- x " 

^ (n + 1)! 

n > o v ' 

= -V(x). 

X 

If we combine this with (3.12.1) we get 

V{x) = xe D{x \ (3.12.3) 
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Now, in previous problems where there was an unknown generating 
function it has always happened that we obtained some sort of functional 
equation that had to be solved in order to find the function. We have seen 
situations where the equation was a diffe rential equation, and others where 
it was a quadratic equation. In (3.12.3) we have a functional equation that 
is to be solved for T){x ), which in fact determines 'D(x) uniquely, but which 
is not a differential equation or an algebraic equation, and whose solution 
isn’t obvious at all. 

There is a powerful tool for dealing with t his kind of a functional 
equation, called the Lagrange Inversion Formula, which will be discussed 


in section 5.1. There we will finish the enumeration of trees as an illustration 
of the use of the Lagrange formula. 


3.13 Exponential families and polynomials of ‘binomial type.’ 

Associated with each exponential family there is a sequence of polyno- 
mials 

K{v) = ^h{n,k)y k (n = 0,1,2,...), (3.13.1) 

fc 

where h(n, k ) is the num ber of h ands of weight n and k cards. In view 
of the exponential formula (3.4.4) these polynomials satisfy the generating 
relation 


e yV(x) _ 

Polynomial sequences that satisfy 
binomial type by Rota and Mullin 
since 


E 

n> 0 


t(y) 


x n 

n\ 


(3.13.2) 


3. 13.2) have been called polynomials of 


[RM] 


The reason for the name is that 


e uV {x) e W( X ) { 0 n ( w)}; 


it- follows that 


(f n (u + v) = \^](j) r (u)(j) n - r (v) (n> 0), 

which is reminiscent of the binomial theorem. 

Although various authors have given combinatorial interpretations for 
such polynomial sequences, the very natural interpretation that appears 
above seems not to have been discussed. That interpretation is: when 
the coefficients of polynomials {4> n (y)} of binomial type are nonnegative, 
then there exists an exponential family T such that for each n > 0, 4> n (y) 
generates the hands of weight n, by numbers of cards. Conversely, every 
exponential family has a family of polynomials of binomial type associated 
with it. 
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3.14 Unlabeled cards and hands 

In the remainder of this chapter we will consider the same kinds of 
problems, except that there will be no label sets to worry about. This 
would seem to simplify things, and it does in some respects, but not in all. 
We will be concerned with how many structures (hands) can be built out 
of given building blocks (cards). 

A card C = C(n,p ) now has only its weight n and its picture p. For 
each n = 1,2,... there is a deck V n that contains d n cards, all of weight n. 
A hand is a multiset of cards. That is, we may reach into one of the decks 
V r and pull out of it some number of copies of a single card C(r, p'), then a 
number of copies of C(r. p"). and so forth, then from another deck we can 
take more cards, etc. 

No significance attaches to the sequence of cards in the hand. What 
matters is which cards have been selected and with which multiplicities. 
The weight of a hand is the sum of the weights of the cards in the hand, 
taking account of their multiplicities. As before, we let h(n, k ) be the 
number of hands of weight n that contain exactly /<:: cards, and we let 

H(x, y) = h(n, k)x n y k . (3.14.1) 

n,k 

Notice that the ‘n!’ is missing in the assumed form of the generating func- 
tion. Instead of the mixed egf-ops that was appropriate for labeled counting, 
a pure ops is the way to go for unlabeled counting. 

We need a generic name for the systems that we are constructing. We 
will call them prefabs (instead of exponential families, which applies in the 
labeled case), and will use letters like V to represent them. 

Thus a prefab V consists of a sequence of decks T) \ . V> 2 , . . . from which 
we can form hands, as described above. In V we let 'D(x) 21Z { d n } . 

The main problem is to find the functional relationship between 7i (x, y ) 
and D(x). so let’s do that now. We will use the Sorcerer’s Apprentice 
method once more. 

For the trickle, consider a prefab V that consists of just one nonempty 
deck, V r , and suppose that T) r contains only a single card. 

In this prefab, a hand if is a fairly simple-minded thing. It consists of 
some number, k say, of copies of the one and only card that there is, and 
its weight will be n = rk. Hence in this prefab the number h(n , k ) of hands 
of weight n that have exactly k cards is 1 if n = rk and is 0 else. Thus 

di(x, y ) = h(n , k)x n y k 

n,k 

= X> vy 

k> 0 

1 

1 — yx r ’ 


(3.14.2) 
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Next, just as in section 3.4, we define the merge operation. If V' and 
V" are prefabs whose picture sets are disjoint, then by their merger V = 
V © V" we mean the prefab whose deck T) n . for each n. is the union of the 
corresponding decks of V' and V" . If there were d' n , d", cards, respectively, 
in those two decks, then there are d n = d' n + d'l cards in V n . 


Fundamental lemma of unlabeled counting. Let TC (■ x,y ), Ti"{x,y) 
and TL(x,y) be the hand enumerators of prefabs V , V" and V = V © V" , 
respectively. Then TL = TCTi" . 


Proof. Consider a hand H G V, of weight n, and containing exactly k 
cards. Some k' of those cards come from V' . and their total weight is, say, 
n' . while the remaining k — k' cards come from V" , and their total weight 
must be n — v! . Thus 


h(n, k ) = h' {ri . k')h"(n — n' ,k — k'), 

k 1 ,n' 

but, by a strange coincidence, that is exactly the relationship which holds 
between the coefficients of the power series Tt, TC and TL" . | 

Armed with the fundamental lemma, we can now consider a slightly 
more complicated prefab V r , which still contains just one nonempty deck 
D r , but now that deck contains d r different cards. By induction on d r = 
1 , 2 ,..., we see at once that the hand enumerator of this prefab is 

n(x,y) = - 1 — ^ (3.14.3) 

(1 — yx r ) a " r 


Finally (dejd vu anybody?), in a general prefab V in which there are 
d n cards in deck V n , for each n = 1, 2, 3, . . ., we observe that V = ©^L 1 V n , 
where the V n are as defined in the previous paragraph. We obtain at once: 

Theorem 3.14.1. In a prefab V whose hand enumerator is 'H(x. y) we 
have 

OO 

n*,y) = n (1 _ ■ < 3 - 14 - 4 ) 


where d n is the number of cards in the nth deck (n > 1). 


This is the analogue of the exponential formula in the case where there 
are no labels. Like the exponential formula, this one too has an astounding 
number of elegant applications, and we will discuss a number of them in 
the sequel. Before we get to that, let’s convert (3.14.4) into a for mula from 


which we could actually compute the h’s from the cfs, using the L yD log ’ 
method of section 1.6. 
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If we take the logarithm of both sides of (3.14.4) 


log H(x,y) = J^log 


— ( (1 -yx s ) c 

J2 d s log-— 


s> 1 


(i -yx s ) 


s>l m> 1 


yin^sm 


m 


V di 

— J m m 

n,m> 1 


where dj is to be interpreted as 0 if its subscript is not a positive integer. 
Next we differentiate with respect to y and multiply by yH, getting 

y sn^yX = n(x y) £ lV » ds . 


Finally, we take [ x n y m ) of both sides, which yields 


mh(n,m) = h(n — rm',m — m')d r (n, m > 1; h(n, 0) = h n ,o)- 

r,m'> 1 

(3.14.5) 

This recurrence holds in any prefab, and permits the numerical computation 
of the hand counts from the deck counts. 

Often the 2-variable deck enumerators H(x,y) or {h(n, k)} Uj k > o give 
more detail than is necessary. If h n = h(n. k ) is the number of hands 
of weight n. however many cards they contain, and if 


H(x) °JZ { h n }*, 


then, since we obtain H( x) from 7 i(x, y) by formally replacing y by 1, the 
general counting theorem (3.14.4) becomes 


H{x) ] ! 


1 


11 (i - x r ) d - ' 

r= 1 v 7 

The recurrence (3.14.5) can be replaced by 

fT'klji ^ ^ Dm,hn— m (^ T lj ho 1), 


(3.14.6) 


(3.14.7) 


m> 1 


where D m = ^ r \ m rd r (m = 1,2,...). 
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Considerably more detailed information can be obtained with just a 
little more effort. Suppose we restrict the multiplicities with which the 
cards can be used in hands. For instance, suppose we decree that every 
card that appears in a hand must appear there with multiplicity that is 
divisible by 3, etc. Then what can be said about the number of hands? 

Let W be a fixed set of nonnegative integers, containing 0. For each 
n and k we let li(n. k: W) be the number of hands of weight n that have 
exactly k cards (counting multiplicities!), each appearing with a multiplicity 
that belongs to W. Let 

H(x,y-,W) =J2 h (n,k-,W)x n y k . 

n,k 

Finally, let 

w(t) = tk - (3.14.8) 

kew 

The generating functions are again multiplicative under merger of pre- 
fabs with disjoint picture sets. Consider a prefab with just 1 card of weight 
r, and no other decks. Then h(n, k\ W ) = 1 if k G W and n = kr, and is 0 
otherwise, and so 

u{x,y,w ) = Y xkr y k = w (y xT )- 

kew 

If there are d r cards in the rth deck, and no other cards, then H (x, y ; W) = 
w(yx r ) dr , and finally we obtain: 

Theorem 3.14.2. Let the prefab V contain decks of sizes d\ , d^. . . and 
let W be a set of nonnegative integers, 0 G W. Ifh(n , k; W ) is the number of 
hands of k cards of weight n, such that each card appears with a multiplicity 
that belongs to W, then 

?f(x,y;W ) = Y h ( n ikW)x n y k = JJ w(yx r ) d \ (3.14.9) 

n,k r> 1 


where w{t ) is given by (3.14.8). 


theorem 3.14.1 


in the case where 


Observe that the theorem reduces to 
W = Z + , the set of all nonnegative integers. 

A noteworthy special case is W = {0, 1}, which means that we can 
choose a card for our hand or not, but we can’t take more than one copy 
of it. In that case (3.14.9) gives 


'H(x,y;{0, 1}) = Y[{l + yx 


r\d r 


r> 1 


(3.14.10) 


H(x,-y-Z+Y 
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We proceed with several examples of the use of these formulas. 

3.15 The money changing problem 

Suppose that in the coinage of a certain country there are 5-cent coins, 
11-cent coins, and 37-cent coins. In how many ways can we make change 
for $17.19? 

In general terms, we are given M positive integers 
1 < ai < a2 < • • • < 8 m, 

and we ask the following question: for each positive integer n, in how many 
ways can we write 


n = x\a\ + X 2 d 2 + ■ ■ ■ + xmclm (Vi : Xi > 0), (3.15.1) 


where the x’s are integers? This problem is of great importance in a number 
of areas, both pure and applied, and it has a very beautiful theory, some of 
which we will give here. 

For given a \, . . . , clm we write S = S(ai , . . . , om) for the set of all n 
that can be written in the form (3.15.1). S is a semigroup of nonnegative 
integers. 

First let’s identify the prefab V in which everything will be happening. 
The decks are almost all empty. The only decks that are not empty are the 
M decks V ai , . . . , V aM . Each of these contains just a single card. Hence 
the deck enumerating sequence is 

j _ f 1 if n — ai, . . . , a,M 
n l0 else. 


In a sense, then, the problem is all over. If h(n, k ) denotes the number 
of ways of making change that use exactly k coins, i.e., the number of 
representations (3.15.1 ) in whic h = then according to the main 


counting theorem (eq. (3.14.4) 


we have 


H(x,y) = 


(1 — yx ai )( 1 — yx a 2 ) •••(! — yx aM )' 


(3.15.2) 


If hji, is the number of ways of representing n without regard to the number 
of coins, then from the cruder formula (3.14.6) 


H(x) = 


(1 -x ffll )(l 


1 

x“ 2 ) ■■■(! — x aM ) 


(3.15.3) 


Even though the generating functions are known, substantial questions 
remain. Here are a few of them. 
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How can we describe the set SI That is, which sums of money can be 
changed? Given 8-cent and 12-cent coins only, it wouldn’t be reasonable 
to expect to make change for 53 cents. In general, if the greatest common 
divisor of the set {ai, . . . ,0 m} is g > 1 , then only multiples of g can be 
represented. But suppose that g = 1, i.e., that the afs are relatively prime. 
Then which integers are representable? The central result of this subject is 
due to I. Schur. It states that S then contains all sufficiently large integers, 
i.e. there e xists an in teger N such that every integer n > N is representable 


in the form (3.15.1). 

The smallest integer N that has the property stated in the theorem will 
be called the conductor of the set S = {oi, . . . , om}, and will be denoted 
by the symbol re = re(«S). 

For instance, every integer > 8 can be represented as a nonnegative 
integer linear combination of 3 and 5, and 7 cannot be so represented, so 
«({3,5}) = 8. 

The problem of determining the conductor of a set S exactly seems to 
be of enormous difficulty. There are no general ‘formulas’ for the conductor 
if M > 3, and no good algorithms for calculating it if M > 4. The case 
M = 2 is already very pretty, and the answers are known, so here they are: 

Theorem 3.15.1. Let a and b be relatively prime positive integers. Then 

(a) every integer n > re = (a — 1)(6 — 1) is of the form n = xa + yb, 
x, y > 0, and 

(b) the integer re — 1 is not of that form, and 

(c) of the integers 0, 1, 2, . . . , re — 1, exactly half are representable and 
half are not. 


Proof. (Our proof follows [NW]’ Since gcd(a , b ) = 1, we can certainly write 
every integer m as xa + yb it x, y can have either sign. The representation 
is unique if we require that 0 < x < b. Then m G S if y > 0, and m ^ S if 
y < 0. The largest integer that is not representable is therefore obtained by 
choosing x — b — 1, y = —1. Hence re(«S) is one unit larger than (6— 1 )a — b, 
and parts (a) and (b) of the theorem are proved. 

To prove (c), let 0 < m < n(S), and again consider the unique way of 
writing m = xa + yb , with 0 < x < b. Then 

m' = k — 1 — m = (b — 1 — x)a + ( — 1 — y)b. 

Now 0 < b — 1 — x < b, so if y > 0 then m is representable and m! is not, 
while if y < 0 then ml is representable and m is not. Hence exactly half of 
the numbers 0, 1, . . . , re — 1 are representable. | 

Now we’re going to prove Schur’s theorem. The idea of the proof is 
that we will consider (witho ut ever w riting it down) the partial fraction 
expansion of the right side of (3.15.3) Among the multitude of terms that 


occur there we will identify one term whose power series coefficients grow 
more rapidly than any other, and this will give the desired result. 
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The generating function 7i(x) in 
poles all lie on the unit circle \x\ = 1. 
of unity. 

What are the multiplicities of these poles? The point x = 1 is a pole of 
multiplicity M, because the denominator of 7i(x) is divisible by (1 — x) M . 
Let (jo = e 27 ™ r / s be a primitive (i.e. , gcd(r, s) = 1) sth root of 1. What is 
the multiplicity with which this point x = oo occurs as a pole of 7d(x)? It 
is equal to the number of ad s that are divisible by s. Since the ad s are 
relatively prime, it cannot be that all of them are divisible by s. 

Therefore x = 1 is a pole of order M of 7i(x), and every other pole 
has multiplicity < M . 

Suppose w is a pole of order r. Then the portion of the partial fraction 
expansion of 7i that comes from u) is of the form 

£1 + ^ + .... 

(1 —x/uj) r (1 ~x/oo) r 1 

Now refer to the power series expansion (2.5.7), which we repeat here: 

1 

(1 - x) k+1 



(3.15.3)| is a rational function whose 
In fact, the Doles are at various roots 


If k = 1, the coefficients of this expansion are linear functions of n. If 
k = 2 they are quadratic functions of n. In general, the coefficients of x n 
are growing, as n — > oo, like n k /k\. 

The contribution of one fixed pole of order r to the coefficient sequence 
of 7i(x) therefore grows like cri r ~ 1 . There is one pole, at x — 1 , of order M. 
Its portion of the partial fraction expansion contributes ~ an A 1 ~ 1 to the nth 
coefficient of 7i(x). Since all other poles have strictly lower multiplicities, 
none of them can alter the asymptotic rate of growth that is contributed by 
the principal pole at x = 1 . Hence, for n — > oo we have h n ~ cn M 1 . That- 
certainly implies that for all large enough values of n we will have h n ^ 0, 
and that finishes the proof. However, as long as we’re here, why not find 
out the value of c also? 

The partial fraction expansion of 7i(x) is of the form 


H(x) = 


1 


(1 -x ai )(l 


X 


«2 


)•••(! — x aM ) 


(1 ~x) 


M 


+ 0((1 -x)~ M+1 ). 


To calculate c, multiply both sides by (1 — x) M and let x — ► 1. This gives 
c = l/(ai • • ■ au)- Thus we get a growth estimate along with the proof of 
the theorem. 
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Theorem 3.15.2 (Schur’s theorem). If h n denotes the number of rep- 
resentations of n as a nonnegative integer linear combination of ai, ... , om, 
these being a relatively prime set of positive integers, then 


h n ~ 


n 


M—l 


(M - l)!aia 2 • • • a M 



oo). 


(3.15.4) 


In particular, there exists an integer N such that every n > N is so repre- 
sentable in at least one way. | 


Example 1. 

Given two relatively prime integers a, b. Find an explicit formula for 
/(n), the number o f ways to change n cents using those coins. 


From 


(3.15.3) | we have 


^f(n)x n 


1 

(1 — x a )(l — x b ) ’ 


(3.15.5) 


so what remains is a partial fraction expansion. We find 

1 A , B , Dc 

(1 — x a )(l— X b ) (1 — x) 2 (1 — x) 1 —x/uj 1 —xjcj 

(3.15.6) 

As regards the constants, we already know that A — 1 / {oh), from 
(3.15.4). To find B , multiply (3.15.6) by (1 — x) 2 , differentiate, and let 
x = 1. This gives B = (o + b — 2)/ (2ab). To find C u , multiply by (1 — x/uj) 
and let x = oo. The result is that C w = l/(a(l — ix b )), and similarly for D^. 
Finally we take the coefficient of x n throughout (3.15.6) to get the formula 


f(n) 


n 

ab 


a + b 
2 ab 






C n' 


(3.15.7) 


If we examine the two sums that appear in (3.15.7) as functions of n , we 
see that each of them is a periodic function of n. The first, sum is periodic of 
period a and the second is periodic of period b. The sum of these two sums 
is therefore periodic of period ab. We have therefore found that the number 
of ways to change n cents into coins of a- and b-cent denominations is 

/(n) = ^ + 1 ^+per(n), (3.15.8) 

where per(n) is periodic of period ab, and is on the average 0. 

We might like to see this periodicity in action, so let’s take a — 3 and 
b — 5. A good way to compute the numbers f(n) is to use the recurrence 
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formula that is implicit in the generating function 


xD log method 


im plicit ] 
on |(3 15. 


5) 


(3.15.5) 


If we use the 


n f{n) = s Y /( n - 3 .?) 

j > i j > i 


we find the recurrence in the form 

n ~ 5j) (n > 1; /(0) = 1), (3.15.9) 


with the understanding that /(m) = 0 if m < 0. Table 3.1 shows n, /(n), 
and 15 (f(n) — (n/15) — (4/15)) (which is periodic of period 15, according 
to (3.15.8)). | 
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n 


3.16 Partitions of integers 

A partition of a positive integer n is a representation 

: ri + r 2 d b r k (n > r 2 > ■ ■ ■ > r k > 1). (3.16.1) 

The numbers r±, ... ,rk ate the parts of the partition. Hence (3.16.1) is a 
partition of n into k parts. 

There are 7 partitions of 5, namely 5=5, =4+1, =3+2, =3+1+1, 
=2+2+1, =2+l+l+l, =1+1+1+1+1. The number of partitions of n is de- 
noted by pin), and p{n , k) is the number of partitions of n into k parts. The 
investigation of the deeper properties of p{n) was one of the jewels of 20th 
century analysis, involving researches of Hardy and Ramanujan and further 
work by Rademacher, the result of which was an exact closed formula for 
p{n ) that was at the same tim e a co mplete asymptotic series. The whole 
story can be found in Andrews 

From our point of view, 
money-changing pro blem w 
able. Thus, theorem 


[An 


;he theory of partitions is the case of the 
lere coins of every positive integer size are avail- 
gives us immediately an opsgf of the partition 
function in the form of the reciprocal of an infinite product, 

1 


3.14.1 


Y P(n, k)x n y k = 


n,k>0 


(1 — yx)( 1 — yx 2 ){ 1 — yx 3 ){ 1 — yx 4 ) ■ ■ ■ 

(p(0,k) = S 0 y ■.). 


(3.16.2) 
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With y = 1 we find 


)x 


n> 0 


(1 — x)(l — x 2 )(l — x 3 )(l — x 4 ) ■ ■ ■ 


(;p(0) = l) (3.16.3) 


as the generating function for {p(n)} itself. 

At the other extreme, we can think about partitions with constrained 
parts and constrained multiplicities of parts. Let two sets W, of nonnegative 
integers, and R, of positive integers, be given, with 0 G W. Let p(n , k; W , R ) 
be the number of partitions of n into k parts such that all of th e parts lie 


in R. and all of their multiplicities lie in W. Then from (3.14.9) 


5 ^ / p(n 1 k;W,R)x n y k = JJ 

n,k rER 


Y, V k x kr 


(3.16.4) 


Kkew 


From this generating function we can prove many theorems about parti- 
tions. 

Example 1. 

Let W = {0, 1}, R = {1,2,.. .}. Then p(n, /c; W, R) is the number of 
partitions of n into k distinct parts, and we have 

5>(n, k; {0, 1}, Z +)x n y k = J]! 1 + V^)- (3-16.5) 

n,k r>l 

If we let y = 1 we obtain 


Xk( n ;{ o, i},z + )x" 


na + v) 

r>l 


n 

r> 1 


1 -x 2r 


1 — x r 


(1 — x 2 )(l — x 4 ) ■ ■ ■ 

(1 — x)(l — x 2 )(l — x 3 )(l — x 4 ) ■ ■ ■ 

1 

(1 — x)(l — x 3 )(l — x 5 )(l — X 7 ) 


The last member, however, generates the partitions of n into odd parts, 
and we have a generating function proof of: 

Theorem 3.16.1. For each n — 1, 2, 3 , . . the number of partitions of n 
into odd parts is equal to the number of partitions of n into distinct parts. 


For instance, the partitions of 5 into odd parts are 5, 3+1+1, and 
l+l+l+l+l, while its partitions into distinct parts are 5, 4+1, and 3+2. 
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Theorem 3.16.1 was discovered by Euler. A great many proofs of it have 
been given. Some of the most interesting proofs are Injective] that is, they 
give explicit constructions that match each partition into odd parts with a 
partition into distinct parts. ( 


Example 2. 

Now let W = {0, 1, . . . , q} and R = Z + . The right side of 
becomes 

/ 1 _ + r (q+ 1 ) 

n ( 1 + f +...+*■*-)= n 

r>l r>l 


(3.16.4) 


1 -t r 


Each factor in the numerator of this product cancels one in the denominator, 
leaving in the denominator only those factors in which r is not divisible by 
q + 1. This proves the following result, which reduces to 
when q = 1 . 


theorem 3.16.1 


Theorem 3.16.2. Fix q > 1. For each n > 1, the number of partitions 
of n into parts that are not divisible by q + 1 is equal to the number of 
partitions of n in which no part appears more than q times. | 


3.17 Rooted trees and forests 

A rooted tree is a tree whose vertices are unlabeled, except that one 
of them is distinguished as ‘the root.’ 

In section 3.12 we counted labeled trees, using the exponential formula. 
Here we will count unlabeled, rooted trees. On each card in a deck T> n there 
is now the integer n and a picture of a rooted tree of n vertices. The deck 
T > 4 is shown in Fig. 3.5. 



Fig. 3.5: The rooted trees of 4 vertices 


A hand of weight n and k cards is, in this case, a rooted forest of n 
vertices and k connected components (rooted trees). If h(n. k ) is the number 
of these and if h (n) = y\ h(n, k ) is the number of all rooted forests of n 
vertices, then by (3.14.6) 


y] h(n)x n 


n 

n> 1 


1 

(i - x n y («) ’ 


(3.17.1) 
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trees of n vertices. 


[see 3.12.2), there is a simple 


where t(n) = h(n, 1) is the number of rooted 
Next, just as we found in the labeled case 
relationship between the number of rooted forests of n vertices and of rooted 
trees of n + 1 vertices: they are equal. Just add a new vertex r to the forest, 
call it the new root, connect it to all of the former roots of the trees in the 
forest, and there is the rooted tree that cor responds to the given forest. 
Hence h(n) = t(n + 1) (n > 0). Then 


(3.17.1) 


takes the form 


^f(n + l)x n 


n 

n> 1 


1 

(i - x n )d«) ' 


(3.17.2) 


This equation in fact determines all of the numbers {f(n)}. It is, however, 
a fairly formidable equation, and we should not expect simple formulas for 
these numbers. 


3.18 Historical notes 


The exponential formula first appeared in the thesis of Riddell [RU] 
in the form of counting connected labeled graphs from a knowledge of the 
number of all labeled graphs. Since then the idea has been generalized and 
extend ed by several researchers. 


In 


[BG] 


[FS] significant extensions 


and at about the same time in 
of the idea were made to very general labeled and unlabeled applications, 
by Bender and Goldman and by Foata and Schiitzenberger. The former 
introduced ‘prefabs’ and the latter used the ‘ compose pa rtitionneV Fu rther 


developments of the method can be found in S tanley ( [St 1] , [St2][ who 


worked with a partition-based approach, in Joyal [Jo 
met hods in his theory of ‘species,’ in Beissingerf Bci 


[GaJ 


who used functorial 
and in Garsia and 


Joni 

The approach taken in this book is most closely akin to the compose 
partitionnel. The suggestion to cast the discussion in terms of cards, decks, 
and hands was made to me in private conversation by Adriano Garsia, when 
I showed him a set of lecture notes of mine that were based on the graph- 
theoretical point of view. I think that his suggestion affords maximum 
clarity of the ideas along with maximum generality of applications. 
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Exercises 


1. Give an explicit 1-1 correspondence between partitions of n into distinct 
parts and partitions of n into odd parts. 


2. Fix integers n, k. Let f(n. k ) be the number of permutations of n letters 
whose cycle lengths are all divisible by k. Find a simple, explicit egf for 
{f(n,k)} n > o- Find a s imple, explicit formula for f(n, k). (Hint: You might 


need the discussion at 


the end of section 3.4. 


) 


3. Find the egf for the partitions of the set [n], all of whose classes have a 
prime number of elements. 


4. In a group T, the order of an element g is the least positive integer p 
such that g p = lp. 

(a) In the group of all permutations of n letters, express the order of 
a permutation a in terms of the lengths of its cycles. 

(b) Let g(n, k ) be the number of permutations of n letters whose or- 
der is k. Express g(n,k ) in terms of the number g(n,m) of n- 
permutations whose cycle lengths all divide m. 

5. Let T n be the number of involutions of n letters. 

(a) Find a recurrence formula that is satisfied by these numbers. 

(b) Compute Ti, . . . , Tq. 

(c) Give a combinatorial and constructive interpretation of the re- 
currence. That is, after having derived it from the generating 
function, re-derive it without the generating function. 


6. Find, in simple form, the egf of the sequence of numbers of permutations 
of n letters that have no cycles of lengths < 3. Your answer should not 
contain any infinite series. 


7. Find the generating function for labeled graphs with all vertices of 
degrees 1 or 2, and an odd number of connected components. Find a 
recurrence formula for these numbers, calculate the first few, and draw the 
graphs involved. 


From 


(3.5.2) find a three term recurrence relation that is satisfied by 


the Stirling numbers of the first kind. Give a direct combinatorial proof of 
this recurrence relation. That is, reprove it, without using any generating 
functions. 


9. As in 


section 3.7 


find the egf of the numbers {(j(n)}\f of permutations 
of n letters that have both of the following two properties: (a) they have 
an odd number of cycles and (b) the lengths of all of their cycles are even. 
Find a simple, explicit formula for these numbers. 

10. Find an explicit formula for { " } , the Stirlin g numbe r of the second 
kind by expanding the kth power that appears in (3.6.2) by the binomial 
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theorem. Your formula should be in the form of a single finite sum. 


11. Let S, T be fixed sets of positive integers. Let /(n; S', T) be the number 
of partitions of [n] whose class sizes all lie in S and whose number of classes 
lies in T. Show that {/(n; S', T)} n >o has the egf eT(es(x)) : where es(x ) = 
T,ses xS /s'~ 


12. Fix k > 0. Let f(n, k) be the number of permutations of n letters 
whose longest cycle has length k. Find the egf of {/(n, &)} n > 0 , for k fixed. 


13. If T{x ) and Q(x ) denot e, respec tively, the egf’s of involutions, in (3.8.3) 
and of 2-regular graphs, in (3.9.1) then observe that 


T(x)G(x) 2 = 


1 

1 — x 


(a) Write out the identity between the sequences {g(n)}, {t n } that is 
implied by the above generating function relation. 

(b) Show that for each fixed n > 1 there are exactly the same numbers 
of 

(i) permutations of n letters and of 

(ii) triples (r, G i, G 2 ), where r is an involution of a set R, G\ 
is a 2-regular graph on a vertex set S : G2 is a 2-regular 
graph on a vertex set T, and R. S,T partition [n] . 

(c) Find, explicitly, a 1-1 correspondence such as is described in part 
(b) above. 

14. Let T be an exponential family with associated polynomials {( f> n (x )} 
of binomial type, and with deck enumerator 'D(x). 

(a) If D y denotes the differential operator d/dy , then show that 
V { ~ 1 ' ) (D y )(j) n (y) = (n > 0) 


by directly applying the operator to the egf of the polynomial 
sequence (here 1 F -1 ) denotes the inverse function in the sense of 
functional composition). 

(b) In the case of the exponential family of permutations by cycles, 
find the associated polynomials of binomial type, and verify the 
identity proved in part (a) by direct computation with those poly- 
nomials. 


15. In an exponential family T . let h[n) be the number of hands of weight 
n whose cards have all different weights. 


a 


Show that 


= 


n> 0 


n\ 


n 

fc=l 




106 


3 Cards, Decks, and Hands: The Exponential Formula 


(b) Let p n be the probability that a permutation of n letters has cycles 
whose lengths are all different. Then 

n{i+x}- 


(c) If p(x) denotes the generating function in part (b) above, deter- 
mine the growth of p(x) as x — » 1“ . Do this by inserting additional 
factors of e~ x in the product. 

16. Let numbers {c n } be defined by 


x 


i+E>-d 


n> 1 


Show that each c n is an integer multiple of n, and in fact is a multiple of 
n(n — 1) if and only if n — 1 divides (n — 2)!. 


17. Here we want to show that the Stirling numbers of the first and second 
kinds a re inverse to each other, in a certain sense. In the generating function 
(3.5.2) f or the fo rmer, replace x by 1 / x and compare with the generating 
f un ction (1.6.5) for the latter. Multiply the functions together so that the 
hard part cancels out. Read off the coefficient of x n in what remains, and 
state it as an assertion that a certain pair of matrices, each involving Stirling 
numbers, are inverses of each other. 


18. Let a n be the number of unlabeled graphs of n vertices each of whose 
connected components is a path or a cycle. Let F(x) be the opsgf of the 
sequence {a n }. Find F(x) and express it in terms of Euler’s opsgf for the 
sequence {pin)} of the numbers of partitions of integers n. 

19. Let a n be the number of unlabeled rooted trees of n vertices in which 
the degree of the root is 2. That is, there are exactly 2 edges incident at 
the root. Let T{x) be the opsgf of the sequence {t n } that counts all rooted 
trees of n vertices. Show that 


J ^a n x n 1 = Vt(j) 2 + T(x 2 ) 


20. Find the largest integer that is not of the form 6x + lOy + 15 z where 
x , y, z are nonnegative integers. Prove that your answer is correct, i.e. , that 
your integer is not so representable, and that every integer larger than it is 
so representable. 


21. In a country that has 1-cent, 2-cent, and 3-cent, coins only, the number 
of ways of changing n cents is exactly the integer nearest to (n + 3) 2 /12. 


22. This exercise develops a con siderable sha rpening of the exponential 
formula, that will be used again in 


section 4.7. 
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(a) In an exponential family J~ . the number of hands of weight n that 
contain exactly a\ cards of weight 1 and a ,2 cards of weight 2 and 
as of weight 3 and . . where a\ + 2o2 + • ■ ■ = n, is the coefficient 
of (f n a;“ 1 X 2 2 ■ ■ -)/n\ in the expansion of 

ex Pt2^— — )■ 

i> i 

(b) Let f(n,r, s ) be the number of partitions of the set [n] that have 
exactly r classes of size 1 and exactly s classes of size 2 (however 
many classes of other sizes they may have). Then 

f(n,r,s)x r y st - - = exp (xt + + e* - 1 - t - *—). 

n,r,s ~ ” 
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Chapter 4 

Applications of generating functions 

4.1 Generating functions find averages, etc. 

Power series generating functions are exceptionally well adapted to 
finding means, standard deviations, and other moments of distributions, 
with minimum work. Suppose f(n) is the number of objects, in a certain 
set S of N objects, that have exactly n properties, for each n = 0.1.2,..., 
with f(n ) = N. What is the average number of properties that an 

object in S has? Evidently it is 

^=JJ^ n f{ n )- (4-1.1) 

n 

Suppose we happen to be fortunate enough to be in possession of the 
opsgf of the sequence {/(n)}, say F(x ) PT {/(n)}. Is there some convenient- 
way to express the mean // of (4.1.1) in terms of F‘l But of course. Clearly, 
//, = F' (1)/ F(l). So averages can be computed directly from generating 
functions. 

Let’s go to the next moment, the standard deviation cr, of the distri- 
bution. This is defined as follows: 

-h) 2 , (4-1.2) 

ujES 

where u) represents an object in the set S, and n ( uj ) is the number of prop- 
erties that uo has. a 2 , which is known as the variance of the distribution, 
is therefore the mean square of the difference between the number of prop- 
erties that each object has and the mean number of properties //,. 

Every one of the /(n) objects u that has exactly n properties will 
contribute (n — fi) 2 to the sum in (4.1.2), and therefore 

a 1 = ~ A0 2 /(™) 

n 

= Jr “ 2 ^ n + A^)/^) 

n 

= -^{(rr-D) 2 - M*D) +h 2 }F(x)\ x=1 (4L3) 

= (F"(l) + (1 - 2/j,)F'(l) + PF(1))/F(1) 

= F"(l)/F(l) + F'(l)/F(l) - (F'(l)/F(l)) 2 
— {(log F)' + (log F)"}a. =1 . 
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So the standard deviation can also be calculated in terms of the values of 
F and its first two derivatives at x = 1. 

Let’s work this out in exponential families. In an exponential family 
T . what is the average number, //(n), of cards in a hand of weight n? 

If h(n, k ) is the number of hands of weight n that have k cards, then 
the average is 

M n ) = kh(n, k ). (4.1.4) 

Now if we begin with the exponential formula 

J2l<n,k)^-y k = e»' D < I > 
n\ 

n,k 

the thing to do is to apply the operator d/dy and then set y = 1. The 
result is that 

Tl 

^ ^ kh(n, k ) = V(x)e D ^ x ' > = T>(x)7i(x). (4.1.5) 

n ' k 

Theorem 4.1.1. In an exponential family T , the average number of cards 
in hands of weight n is 


p(n) 


h{n)x n 


V(x)H{x) 



r ). 


(4.1.6) 


Example 1. Cycles of permutations 

The averaging relations (4.1.6) are particularly happy if h(n ) = n!, as 
in the family of all permutations. There, (4.1.6) becomes 


h(n) = (.r - l)!(n - r)\ 


1 1 1 

— 1 + - + -H H — • 

2 3 n 

Consequently, the average number of cycles in a permutation of n letters is 
the harmonic number H n . 

W hat is the standard deviation? The function F{x) that appears in 


(4.1.3), in the case of permutations, is, for n fixed, 

F(x) = yy h(n, k)x k = x(x + l)(x + 2) ■ ■ ■ (x + n — 1), 


110 


4 Applications of generating functions 


(3.5.2) After taking logarithms and differentiating, following (4.1.3) 


by 

we'fmd F{l) = n\, (logF)'(l) = H n , and 

(log F)"(l) = -1 - 1/4 - 1/9 - 1/16 1/n 2 . 


If we substitute this into 


(4.1.3) 


we find that the variance of the distribution 


of cycles over permutations of n letters is 

cr 2 = H n — 1 — 1/4 — 1/9 — •• 
= logn + 7 — 7t 2 /6 + o(1). 


1 /n 2 


where 7 is Euler’s constant. 

Hence the average number of cycles is ~ log n with a standard deviation 
a ~ ydog n. | 


4.2 A generatingfunctionological view of the sieve method 

The sieve method* is one of the most powerful general tools in com- 
binatorics. It is explained in most texts in discrete mathematics, however 
it most often appears as a sequence of manipulations of alternating sums 
of binomial coefficients. Here we will emphasize the fact that generating 
functions can greatly simplify the lives of users of the method. 

We are given a finite set 0 of objects and a set P of properties that 
the objects may or may not possess.** In this context, we want to answer 
questions of the following kind: how many objects have no properties at 
all? how many have exactly r properties? what is the average number of 
properties that objects have? etc., etc. 

The characteristic flavor of problems that the sieve method can handle 
is that, although it is hard to see how many objects have exactly r proper- 
ties, for instance, it is relatively easy to see how many objects have at least 
a certain set of properties and maybe more. 

What the method does is to convert the ‘at least’ information into the 
‘exactly’ information. 

To see how this works, if S C P is a set of properties, let iV(I) S ) 
be the number of objects that have at least the properties in S. That is, 
7 V(D .S') is the number of objects whose set of properties contains S. 

For fixed r > 0, consider the sum 

n v = J2 n o s )• ( 4 . 2 . 1 ) 

\S\=r 

* A.k.a. ‘the principle of inclusion-exclusion,’ and often abbreviated as 
‘p.i.e.’ 

** Strictly speaking, a property is just a subset of the objects, but in 
practice we will usually have simple verbal descriptions of the properties. 
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Introduce the symbol P(u) for the set of properties that uj has. Then we 
can write N r as follows: 


N r = ^ N(D S ) 

\S\=r 

= E E 1 


\S\=r 

1 1 SCP(uj) 


E E i 

I I S | =r 

SCP(uj) 


E 


l-PMI 

r 


(4.2.2) 


Therefore evert/ object that has exactly t properties contributes (*) to 
IV r . If there are e* objects that have exactly t properties, then (4.2.2) 
simplifies to 

N r = I2( t ) e * ( r = 0,1,2,...)- (4-2.3) 

t>o 

Recall the philosophy of the method: t he N r ’s are easier to calculate 
than the e r ’s because they can be found from (4.2.1) However, the e r ’s are 
what we want. Therefore it is desirable to be able to solve the equations 

(4.2.3) for the e’s in terms of the N’s. But how can we do that? After all, 

(4.2.3) is a set of simultaneous equations. 

At first glance that might seem to be a tall order, but with a friendly 
generating function at your side, it’s easy. Let N(x) and E(x) denote* the 
opsgf’s of the sequences { A? } , {e r }, respectively. What relation between 
the two generating functions is implied by the equations (4.2.3)? 

Multiply (4.2.3) by x r and sum on r. We then get 




r t 


e t x 



(4.2.4) 


* The letters ‘N’ and ‘E’ are intended to suggest the N r 's and the word 
Exactly. ’ 
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In the language of generating functions, the set of equations (4.2.3) 


boils down to the fact that N(x) = E{x + 1). Now the problem of solving 
for the e’s in terms of the TV’s is a triviality, and the solution is obviously 


E(x) = N(x — 1) 


(4.2.5) 


This is the sieve method. The act of replacing the variable x by x — 1 in 
the generating function N(x) replaces the unfiltered data {N r } by the sieved 
quantities { e r }. 

If the IV’ s are known, then in principle we can read off the e’s as the 
coefficients of N(x — 1). 

For example, eo is the number of objects that have no properties at 
all. By (4.2.5), 

e 0 = E{ 0) = N(- 1) = ^(-1)^. (4.2.6) 

t 

It’s easy to find explicit- formulas for all of the e/s by looking at the coef- 
ficient of on both sides of (4.2.5). The result is 


< y = E(- 1 >‘" 3 '(*) JV <- ( 4 - 2 - 7 ) 

But (4.2.5) says it all, in a much cleaner fashion. 

We will now summarize the sieve method, and then give a number of 
examples of its use. 


The Sieve Method 


(A) ( Find O and P ) Given an enumeration problem, find a set of objects 
and properties such that the problem would be solved if we knew 
the number of objects with each number of properties. 

(B) ( Find the unfiltered counts N(D S)) For each set S of properties, 
find IV ( D S), the number of objects whose set of properties contains 
S. 


(C) 

(D) 


[Find the coefficients N r ) For each r > 0, calculate t he Ay by sum- 


ming the NfD S ) over all sets S of r properties, as in (4.2.1) 

( The answer is here.) The numbers e r are the coefficients of the 
powers of x in the polynomial N(x — 1). | 


Before we get to some examples, we would like to poin t out that the 
number N\ has a special role to play. According to (4.2.3). N\ = s f2 t tet. 
That, however, is what you would want to know if you were trying to 
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calculate the average number of properties that objects have. Hence it is 
good to remember that when using the sieve method on a set of N objects, 
the average number of properties that an object has is Ni/N. 


Example 1. The fixed points of permutations. 

Of the n\ permutations of n letters, how many have exactly r fixed 
poi nts? 


and 


Step (A) of the sieve method asks us to say what the set of objects is 
what the set of properties is. It is almost always worthwhile to be quite 


explicit about these. In the case at hand, the set 0 of objects is the set of 
all permutations of n letters. There are n properties: for each i = 1, . . . , n, 
a permutation r has property i if i is a fixed point of r, i.e., if r(i) = i. 

With those definitions of 0 and P , it is indeed true that we would like 
to know the numbers of objects that have exactly r properties, for each r. 


In |step (B)| we must find the N( D S ). Hence let A be a set of properties. 
Then S C [n] is a set of letters, and we want to know the number of 
permutations of n letters that leave at least the letters in S fixed. 

If a permutation leaves the letters in S fixed, then it can act freely 
on only the remaining n — |Sj letters, and so there are ( n — |<Sj)! such 
permutations. Hence 

N(DS) = (n-\S\)l 


For 


step (C) we calculate the N r 's. But, for each r = 0, . . . , n, 


X r =Yi N QS)= £>-|S|)! 

|S|=r \S\=r 


n 


(n — r)! = 


n\ 

p\ 


In step (D) we’re ready for the answers. It will save some writing if we 
introduce the abbreviation exp| a for the truncated exponential series 


ex P | Q (x) = 


0 <r<a 


X 


(4.2.8) 


Now we form the opsgf N(x) from the N r 's that we just found: 

n , n r 

at/ \ r i 

r= 0 r = 0 

Then et is the coefficient of x l in N(x — 1), i.e., 


t i y x ~ ^) 7 


E(x) = ^ e t x t = n\ ^ 


r = 0 


rp | 


= n!exp| n (x - 1). 


(4.2.9) 


As an extra dividend, the average number of fixed points that permu- 
tations of n letters have is 


Ni _ n[ 
~N _ n! 
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On the average, a permutation has 1 fixed point. 

The number of permutations that have no fixed points at all is 


e 0 = F(0) = N(—l) = n!exp jn (-l) ~ 


(4.2.10) 


Finally. if we really want a formula for the e^’s, it’s quite easy to find 


from (4.2.9) that 



(4.2.11) 


Example 2. The number of A-cycles in permutations. 

Fix positive integers n, A, and r > 0. How many permutations of n 
letters have exactly r cycles of length A? 

Whatever the answer is, it should at least have the good manners to 
reduce to the answer of the previous example when k = 1, since a fixed 
point is a cycle of length 1. 

What are the objects and the properties? Evidently Q is the set of all 
permutations of n letters. Further, the set P of properties is the set of all 
possible A:- cycles chosen from n letters. How many such A’-cycles are there? 
The k letters can be chosen in (") ways, and they can be arranged around 
a cycle in ( k — 1)! ways, so we are facing a list of (^)(A — 1)! properties. 

Choose a set S of A-cycles from P. How many permutations have at 
least the set S of properties? None at all, unless the sets of letters in those 
cycles are pairwise disjoint. If the sets are pairwise disjoint, then there are 
N(D S ) = (n — k\S\)\ permutations that have at least all of those A-cycles. 

Next we calculate N r . the sum of N(D S ) over all sets of r properties. 
The terms in this sum are either 0 or (n — kr)\. So we really need to know 
only how many of them are not 0, that is, in how many ways we can choose 
a set of r A-cycles from n letters in such a way that the cycles operate on 
disjoint sets of letters. 

The letters for the first cycle can be chosen in (”') ways, and they can 
be ordered around the cycle in (A’ — 1)! ways. The letters for the second cycle 
can then be chosen in (" j/') ways, and ordered in (A — 1)! ways, etc. Finally, 
since the sequence in which the cycles are constructed is of no significance, 
we divide by r!. Hence 


(n — kr)\ n!(A— l)! r 
r! (A!) r (n — kr)\ 


(4.2.12) 


(0 < r < n/k). 


4.2 A generatingfunctionological view of the sieve method 
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We can get a little piece of the solution right here, with no more 
work: the average number of A:-cycles that permutations of n letters have 
is Ni/n\ = 1/k. 

The opsgf of {N r } is 


N(x ) = n! ^ 


x 


0<r<n/k 


k r r\ 


(4.2.13) 


= 7)1 


X 

^ ! exp| (n/fc) (-). 


Finally, in the sieving step, we convert this to exact information by 
replacing x by x — 1, to obtain 


E(x) = n\ ex P | (n/fc) 


x — 1 


(4.2.14) 


Example 3. Stirling numbers of the second kind. 


The Stirling numbers j™}, which we studied in section 1.6, are the 
numbers of partitions of a set of n elements into k classes. We can find out 
about them with the sieve method if we can invent a suitable collection of 
objects and properties. For the set 0 of objects we take the collection of all 
k n ways of arranging n labeled balls in k labeled boxes. Further, such an 
arrangement will have property Pi if box i is empty (i = 1 , ,k). Then 
/c!{^} is the number of objects that have exactly no properties. 

Let S be some set of properties. How many arrangements of balls in 
boxes have at least the set S of properties? If N(D S ) is that number, then 
N(0 S) counts the arrangements of n labeled balls into just k — |Sj labeled 
boxes, because all of the boxes that are labeled by S must be empty. 

There are obviously ( k — .Sj ) n such arrangements. Hence 


N(D S ) = 


(A;-|S|) n if |Sj < k, 


0, else. 

If we now sum over all sets S of r properties, we obtain for r < k, 

Nr = ( k \k-r)\ 


whose opsgf is 

N(x)= ( k \k-r) n x r . 

0<r<k ^ r ' 

We can now invoke the sieve to find that the number of arrangements 
that have exactly t empty cells is the coefficient of x l in N{x — 1). On the 
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other hand, the number of arrangements that have exactly t empty cells is 
clearly 



k\ f n j 
t\ \/c — t J 


The result is the identity 


0 <r<k 


E ()(k-rnx-iy = k\ Y, 


0 <t<k 


n i r 

k — 1 1 t\ 


(4.2.15) 


If we put x = 0, we find the explicit formula (1.6.7) again. 
If, on the other hand, we compare (4.2.15) 


wi 


h the rule 


(2.3.3) 


for 


finding the coefficients of the product of two egf’s, we discover the following 
remarkable identity: 


E 

l<k<n 




r>l 


(4.2.16) 


This shows that e y times the infinite series is a polynomial! The special 


case y = 1 has been previously noted in (1.6.10) 


Example 4. Rooks on chessboards 

For n fixed, a chessboard C is a subset of [n] X [n]. We are given C, 
and we define a sequence {ry ;: } as follows: r k is the number of ways we can 
place k nonattacking (i.e., no two in the same row or column) rooks on C . 

Next, let a be a permutation of n letters. For each j we let e :j denote the 
number of permutations that ‘meet the chessboard C in exactly j squares,’ 
i.e., if the event ( i , a(i)) G C occurs for exactly j values of f, 1 < i < n. 

The question is, how can we find the e^’s in terms of the ry’s? 

Let the objects Q be the n! permutations of [n]. There will be a 
property P(s ) corresponding to each square s G C. A permutation a has 
property P(s ) if a meets the mini-chessboard that consists of the single cell 
s. 

Let S be a set of properties, i.e., of cells in C, and consider the sum 
Ay. = ^| 5 | =/i . A r (I) S). Each arrangement of k nonattacking rooks on C 
contributes ( n — k)\ to this sum. Indeed, when the set S corresponds to the 
cells on which those rooks can be placed, then we are looking at k of the n 
values of a permutation that hits C in at least k squares. The permutation 
can be completed, in the remaining n — k rows, in (n — k)\ ways. 

Hence — k)\. for each /c, 0 < k < n. Therefore 


N(x ) = ^^(n — k)\r k x k , 

k 

and immediately we find that the number of n-permutations that hit C in 
exactly j cells is 

[P] y^(n — k)\r k (x — l) k . 

k 


(4.2.17) 


4.3 The ‘Snake Oil’ method for easier combinatorial identities 
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Example 5. A problem on subsets. 

This example is more cute than profound, but we will at least finish 
with a combinatorial proof of an interesting identity, as well as illustrating 
the generating function aspect of the sieve method. 

For a fixed positive n, take as our set 0 of objects the ( 2 ; ('j ways 
of choosing an n-subset of [2 n\. For the set P of properties we take the 
following list of n (not 2 n) properties: an n-subset Q has property i if 
i ^ Q, for each i = 1,2, ... ,n (note that we are working with only the first 
half of the possible elements of S ). 

If S is a set of properties (i.e., is a set of letters chosen from [n]), then 
the number of ‘objects’ Q that have at least that set of properties (i.e., are 
missing at least all of the i G S) is clearly 


N(D S ) 


2 n — |Sj 


n 


Hence 



If we substitute these N's into the sieve 


(4.2.5) 


we find that 




2 n — r 
n 


(t-iy. 


(4.2.18) 


This formula tells us the number e ? of objects that have exactly j properties, 
for each j. 

But we didn’t need to be told that! 

An object that has exactly j of these properties is a subset Q of [2n] 
that is missing exactly j of the elements 1, 2, . . . , n. Obviously there are 
just (") such subsets Q, because we can choose the j elements that they 
are missing in (") ways, and we can then choose the other j elements t-hat- 
are needed to fill the subset from n + 1, . . . , 2n in (") ways also. 

Thus, with no assistance from the sieve method, we already knew that 
ej = (") , for all j. Hence, according to (4.2.18), it must be true that 

We therefore have an odd kind of a combinatorial proof of the identity 
(4.2.19). The reader should suspect that something of this sort is going on 
whenever an identity involves an expansion around the origin on one side, 
and an expansion around t = 1 on the other side. 
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4.3 The ‘Snake Oil’ method for easier combinatorial identities 

Combinatorial mathematics is full of dazzling identities. Legions of 
them involving binomial coefficients alone fill text- and reference books 
(see below for some references). It is a fine skill for a working discrete 
mathematician to have if he/she is able to evaluate or simplify complicated 
looking sums that involve combinatorial numbers, because they have a way 
of turning up in connection with problems in graphs, algorithms, enumer- 
ation, etc. (they’re fun, too!). 

In the past, one had to have built up a certain arsenal of special devices, 
the more the better, in order to be able to trot out the correct one for 
the correct occasion. Recently, however, a good deal of quite dramatic 
systematization has taken place, and there are unified methods for handling 
vast sub-legions of the legions referred to above. 

In this section we are going to do two things. First we will give a single 
method (the Snake Oil* method) that uses generating functions to deal 
with the evaluation of combinatorial sums. That one method is capable 
of handling a great variety of sums involving binomial coefficients, but 
there’s nothing special about binomial coefficients in this respect. The 
method also works beautifully, within its limitations, on sums involving 
other combinatorial numbers. The philosophy is roughly this: don’t try 
to evaluate the sum that you’re looking at. Instead, find the generating 
function for the whole parameterized family of them, then read off the 
coefficients. 

Second, we will confess that Snake Oil doesn’t cure them all. Some 
combinatorial sums are really hard. Many of the very hardest binomial 
coefficient sums can now be proved by computers using the method of 
rational functions, which we will discuss next. Not only that, but use 
of the computer has resulted in some new proofs of classical identities. 
The hallmarks of these proofs are that (a) they are very short compared 
to the previously known proofs, (b) they seem extremely unmotivated to 
the reader, but (c) nothing is left out, and they really are proofs. The 
computerized proof techniques rely on a very simple-looking observation, 
which we will describe and illustrate. 

Therefore, in this section you can expect to see one unified method 
that works on a lot of relatively easy sums, and one other unified method 
that works on many more kinds of binomial coefficient sums, including some 
fiendishly difficult ones. 

First let’s talk about the Snake Oil method. 

The basic idea is what I might call the external approach to identities 


* The Random House Dictionary of the English Language defines ‘snake 
oil’ as a purported cure for everything, and gives the example The governor 
promised to lower taxes, but it was the same old snake oil. The date of the 
expression is given as ‘1925-30, Amer.’ 
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rather than the usual internal method. 

To explain the difference between these two points of view, suppose we 
want to prove some identity that involves binomial coefficients. Typically 
such a thing would assert that some fairly intimidating-looking sum is in 
fact equal to such-and-such a simple function of n. 

One approach that is no w custo mary, thanks to the skillful exposition 
an d deft h andling by Knuth in 


Kn 


and by Graham, Knuth and Patashnik 
in|[GKP], consists primarily of looking inside the summation sign (‘inter- 
nally j . and using binomial coefficient identities or other manipulations of 
indices inside the summations to bring the sum to manageable form. 

The method that we are about to discuss is complementary to the in- 
ternal approach. In the external, or generatingfunctionological, approach 
that we are selling here, one begins by giving a quick glance at the expres- 
sion that is inside the summation sign, just long enough to spot the ‘free 
variables,’ i.e., what it is that the sum depends on after the dummy vari- 
ables have been summed over. Suppose that such a free variable is called 
n. 

Then instead of trying to grapple with the sum, just sweep it all under 
the rug, as follows: 


The Snake Oil Method for Doing Combinatorial Sums 

(a) Identify the free variable, say n, that the sum depends on. Give a 
name to the sum that you are working on; call it /(n). 

(b) Let F(x ) be the opsgf whose [x n ] is f(n), the sum that you’d love 
to evaluate. 

(c) Multiply the sum by x n , and sum on n. Your generating func- 
tion is now expressed as a double sum over n, and over whatever 
variable was first used as a dummy summation variable. 

(d) Interchange the order of the two summations that you are now 
looking at, and perform the inner one in simple closed form. For 
this purpose it will be helpful to have a catalogue of series whose 
sums are known, such as the list in section 2.5 of this book. 

(e) Try to identify the coefficients of the generating function of the 
answer, because those coefficients are what you want to find. 


If that seems complicated, just wait till you see the next seven exam- 
ples. By then it will seem quite routine. 

The success of the method depends on favorable outcomes of steps 
(d) and (e). What is surprising is the high success rate. It also has the 
‘advantage’ of requiring hardly any thought at all; when it works, you know 
it, and when it doesn’t, that’s obvious too. 

We will adhere strictly to the customary conventions about binomial 
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coefficients and the ranges of summation variables. These are: first that the 
binomial coefficient ()(J vanishes if m < 0 or if £ is a nonnegative integer 
that is smaller than m. Second, a summation variable whose range is not 
otherwise explicitly restricted is understood to be summed from — oo to oo. 
Thus we have, for integer n > 0, 



= 2 r 


in the sense that the sum ranges over all positive and negative and 0 values 
of k, the summand vanishes unless 0 < k < n. and the sum has the value 
advertised. These conventions will save endless fussing over changing limits 
of summation when the dummy variables of summation get changed. For 
example, we find that 



= x~ r (l + x) n , 


for nonnegative integer n and integer r, without ever even thinking about- 
the ranges of the summation variables. 

The series evaluations that are most helpful in the examples that follow 
are, first and foremost, 



(1 - x) k+1 


(k > o), 


(4.3.1) 


which is basically a rewrite of|(2.5.7) 


Also useful are the binomial theorem 



( l + x) n 


(4.3.2) 


and (2.5.11), which we repeat here for easy reference: 


E 


1 

n + 1 



T(i-vT^). 


(4.3.3) 


Example 1. Openers 

Consider the sum 


The free variable is n, so let’s call the sum f(n). Write it out like this: 
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0 

at 


k. now multiply both sides by x n and sum over n. You have now arrived 


step (c) of the general method, and you are looking at 


F(x)=Y, x "T, 


n k> 0 


k 

n — k 


Ready for 


step (d) 


Interchange the sums, to get 


f w = EEL- 


k> 0 n 


k 


n — k 


x 


We would like to ‘do’ the inner sum, the one over n. The trick is to 
get the exponent of x to be exactly the same as the index that appears in 
the binomial coefficient. In this example the exponent of x is n. and n is 
involved in the downstairs part of the binomial coefficient in the form n - k . 
To make those the same, the correct medicine is to multiply inside the sum 
by x~ k and outside the inner sum by x k . to compensate. The result is 

^)=£*‘£ ( n k _ k y- k - 

k > 0 n v 7 


Now the exponent of x is the same as what appears downstairs in the 
binomial coefficient. Hence take r — n ~ k as the new dummy variable of 
summation in the inner sum. We find then 


F ( x ) = J2 xk J2 


k> o 


x 


We recognize the inner sum immediately, as (1 + x) k . Hence 


F(x) = ^ x k (l + x) k = ' S ^2,{x + x 2 ) k 

k> 0 fc>0 


1 

1 — x — X 2 


The generating funct ion on the rig 
bonacci numbers (see 


it is an old friend; it generates the Fi- 
Example 1.3 of chapter 1). Hence f{n) = F n . |_i, and 


we have discovered that 



(n = 0, 1, 2, . . .). 
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Example 2. Another one 

Consider the sum 


E 


n + k \ f2k\ (— 1) 


m + 2kj \ k J k + 1 


(m, n > 0). 


(4.3.4) 


Can it be that the same method will do this sum, without any further 
infusion of ingenuity? Indeed; just pour enough Snake Oil on it and it will 
be cured. Let f(n) denote the sum in question, and let F(x) be its opsgf. 
Dive in immediately by multiplying by x n and summing over n > 0, to get 


F ( x ) = 

n> 0 

= E 

k 

= E 

k 

= E 


n + k \ (2 k\ (— 1) 


m + 2k 
2k\ (-1)* 


k J k + 1 


■x 


w\(-i) k __ k 


k J k + 1 


■x 


E 

n> 0 

E 

r>k 


k J k + 1 

n + k 
m + 2k 


m + 2k 


x 


X 


n-\-k 


2k\ (-l) fc _ k x m+2k 


-x 


k J k + 1 (1 — x) m + 2fc+1 


(by (4.3.1)) 


X 


E 


2k' 


(1 — x) m+1 ^ \ k J k + 1 ((1 — x ) 2 


—x 


— x 


m— 1 


2(1 - x)™- 1 
1 


1 \ 1 + 


4x 


(i — xy 


2(1 -x) 


m— 1 


1 - 


1 + X 

1 — X 


X 


(l-x) m ' 

The original sum is now unmasked: it is the coefficient of x n in the 
last member above. But that is ((( l | ). by (4.3.1) again, and we have our 
answer. See exercise 16 for a generalization of this sum. 

If the train of manipulations seemed long, consider that at least it’s 
always the same train of manipulations, whenever the method is used, and 
also that with some effort a computer could be trained to do it! | 

Example 3. A discovery 

Is it possible to write the sum 


/»= Ef- 1 )' 




n 


,n—2k 


y 


(n > 0) 


(4.3.5) 
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in a simpler closed form? 

This example shows the whole machine at work again, along with a 
few new wrinkles. The first step is to let F ^ {fn}, and try to find the 
generating function F inste ad of th e sequence {/ n }. 

To do that we multiply (4.3.5) on both sides by x n and sum over n > 0 
to obtain 

F(x) = 


n> 0 


fc<§ 


k 


y 


n—2k 


The next step is invariably to interchange the summations and hope. To 
try to make the innermost summation as clean looking as possible, be sure 
to take to the outer sum any factors that depend only on k. This yields 


F(x) = E< 

k 


-i)V M E 


n>2fc 


n 


x n y n . 


Now focus on 


(4.3.1), and try to make the inner sum look like that. If in 


our inner sum the powers of x and y were x" k y n fc , then those exponents 


would match exactly the upper story of the binomial coefficient ( " fc /,: ) , and 
so after a change of du mmy var iable of summation we would be looking 


exactly at the left side of (4.3.1) 


Hence we next multiply inside the inner sum by x k, “ k 


y 


and outside 


the inner sum by x k y k . 


Now we have 


F(x) = E(-i)V 3 W E ( k )x n - k y n - k 

k n>2k ' 

= E(-i)^V‘E (l) l - x y') a 

k a>k ^ ' 

= Ey F k y- k {1 [ X ^ )k+I (by (4.3.1)) 

= — !— El^-f 

1 — xy 1 — xy I 


(4.3.6) 


1 1 

1 - xy 1 + 

1 

1 — xy + x 2 

(Question: Why, after the third equals sign above, did the range of k get 
restricted to l k > O’?) 
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We now expand 
the sum 


(4.3.5) 


(4.3.6) in partial fractions to obtain a closed form for 
This gives 


F(x) 


1 

(1 — XX+)(1 — XX - ) 

X + X- 

(x + — X-)(l — XX+) (x + — X_)(l — XX-) ’ 


where 


x± = 


y 


± Vv 2 - 4 


Hence, for n > 0 the coefficient of x n is 


fn = 


Vy 2 ~ 4 



+ V? 


n+l 



n+l ' 


We now have our answer, but just for a demonstration of the effec- 
tiveness of cleanup operations, let’s invest a little more time in making the 
answer look as neat as possible. Because of the ubiquitous appearance of 
\Jy 2 — 4 in the answer, we replace y formally by x + (l/.x). Then 


Vy 2 


4 = x — — , 
x 


and our formula becomes 
'n — k 




fc<f 


k 


(x 2 + l) n ~ 2k x 2k = 


X 


2n+2 


x 2 — 1 


(n > 0). 


Finally we write t = x 2 to obtain the pretty evaluation 


£(-o' 


t't 


n — k 
k 


(t + \) n ~ 2k t k = 


1 - t n+1 
1 -t 


(n > 0). (4.3.7) 


For instance, the value t = 1 gives 

Tt. — k' 


E(-D ; 


fc<f 


k 


■\n-2k 


= n + l (n > 0). 


(4.3.8) 


As a final touch, we can read off the coefficient of t m in (4.3.7) to 
discover the interesting fact that 



1, if 0 < to < n; 
0, otherwise. 


(4.3.9) 
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Try this identity with n = 2 and watch what happens. f§ 
Here is another example of the same technique. 

Example 4. 

Evaluate the sums 



(n > 0). 


(4.3.10) 


Without stopping to think, let F be the opsgf of the sequence, multiply 
both sides of (4.3.10) by x n , sum over n > 0, and interchange the two sums 
on the right. This produces 


n + k 
2k 


f =E 2_ ‘E 

k n> 0 

= E 2 “*( 2 *)“*E 

k 

= E 2 -fc (2x) 


n> 0 


2 n x n 

n + k 
2k 


\—k 


(2x) 


2k 


k> 0 


(1 -2x) 2k + 1 


(2x) n+k 

(by (4.3.1)) 


1 y{ i V 

1 - 2x ^ Q \ {1 - 2x) 2 J 


1 


1-2x1- 
1 - 2x 


(1 — 2a;) 2 


(1 — Ax)(l — x) 

2 1 

+ 


3(1 — Ax) 3(1 — x) 


It is now a triviality to read off the coefficient of x n on both sides and 
discover the answer: 


( n + n -k _ 

4T 2 k) = 


2 2n +i + i 

3 


(n > 0). 


(4.3.11) 


Example 5. 

Our next example will be of a sum that we won’t succeed in evaluating 
in a neat, closed form. However, the generating function that we obtain 
will be rather tidy, and that is about the most that can be expected from 
this family of sums. 
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The sum is 


fn(y ) = 


n\ f 2k 


k V k 


y k (n > o). 


(4.3.12) 


Follow the usual prescription. Define F(x,y) = fn(y)x n ■ To find 

F. multiply (4.3.12) by x n . sum over n > 0 and interchange the inner and 
outer sums, to obtain 


F{x,y) = 


2k 


y k H 

n> 0 


X 


E( 2 /> 

k v 7 

— T 

1 — x ^ 


X 


(1 — x) k+1 


2k 

k 


xy 

1 — x 


(4.3.13) 


Now since 


E 


2k 


z k = 


\/l —4z’ 


by (2.5.11), we obtain 

F(x,y ) 


(l-x)^/l 


4 xy 
1—x 


y/(l - x )(l - x (l + 4y)) 


(4.3.14) 


(4.3.15) 


For general values of y. that’s about all we can expect. There are two 
special values of y for which we can go further. If y = —1/4, we find that 


E 


2 k\ ( n 


k \k 


'-\r = r 


2 n 
n 


(n > 0). 


(4.3.16) 


If y = —1/2, then 

F(x,~ 1/2) = 1 


— x z 


E 


2m 


m 


(x/2) 


2 m 


(by (2.5.11)). 


Hence we have Reed Dawson’s identity 

e(“)(;)(-d‘*-‘={P' 


if n > 0 is even, 
if n > 0 is odd, 


(4.3.17) 
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and Snake Oil triumphs again. | 


Example 6. 

Suppose we have two complicated sums and we want to show that 
they’re the same. Then the generating function method, if it works, should 
be very easy to carry out. Indeed, one might just find the generating 
functions of each of the two sums independently and observe that they are 
the same. 

Suppose we want to prove that 



(to, n > 0) 


without evaluating either of the two su m s. 

Multiply on the left by x n , sum on n > 0 and interchange the summa- 
tions, to arrive at 



£(: 

k v 

ry.m 

I ^ - 

(1 — x) m+l \ "I" X 
(1 + x) m 
(1 - x ) m+1 ' 


\ _ k x m 
) x (l-x) m+1 


If we multiply on the right by x n , etc., we find 

1 / 2x \ m 

(1 — x)\ 1 — X ) 

( l+x) m 
- (1 - x) m+1 ' 


Hence the two sums are equal, even if we don’t know what they are! j§ 


Example 7. 

There are, in combinatorics, a number of inversion formulas , and gen- 
erating functions give an easy way to prove many of those. An inversion 
formula in general is a relationship that expresses one sequence in terms 
of another, along with the inverse relation, which recovers the original se- 
quence from the constructed one. 

We have already seen a couple of famous examples of these . One is 
the Mobius inversion formula, which is the pair (2.6.11), (2.6.12) Another 
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is the pair (4.2.3), (4.2.7) that occurred in the sieve method. We repeat 
that pair here, for ready reference. It states that if we compute a sequence 
{N r } from a sequence {e r } by the relations 


^ = £ 


t>o 


e t 


(r = 0,1,2,...), 


(4.3.18) 


then we can recover the original sequence (‘invert’) by means of 


e t = 


£(-i y 


-tii 
t 


t,J 'N t (t> 0). 


To give just one more example of such a pair of formulas, consider the 
relation 

V N 


a r = 


E J 6s 


(4.3.19) 


which differs from the previous pair in that the summation is over the lower 
index in the binomial coefficient. How can we find the relations that are 
inverse to (4.3.19)? That is, how can we solve for the 6’s in terms of the 


a’s? 


The answer is that we convert the relation (4.3.18) between two se- 
quences into a relation b etween their exponential generating functions, 
which we then invert. By (2.3.3) we have A(x) = e x B(x), where A and 
B are the egf’s. Hence B ( x) = e ' A ( x ) , and therefore 


*» = £ 


■i)’ 


(n > 0). 


(4.3.20) 



Example 8. Snake Oil vs. hypergeometric functions. 

Many combinatorial identities are special cases of identities in the the- 
ory of hypergeonretric series (we’ll explain that remark, briefly, in a mo- 
ment). However, the Snake Oil method can cheerfully deal with all sorts 
of identities that are not basically about hypergeometric functions. So the 
approaches are complementary. 

A hypergeometric series is a series 

k 
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in which the ratio of every two consecutive terms is a rational function of 
the summation variable k. That means that 

T k+1 _ P(k ) 

T k Q(fc)’ 


where P and Q are polynomials, and it takes in a lot of territory. Many 
binomial coefficient identities, including all of the examples in this chapter 
so far, are of this type. There are some general tools for dealing with such 
sums, and these are very important considering how frequently they occur 
in practice. F or a d iscussion of some of these tools, see, for example, the 


article by Roy [Ro]. 

In this example we want to emphasize that the scope of the Snake Oil 
method includes a lot of sums that are not hyper geometric. Consider, for 
instance, the following sum- 


/( n ) = 


n 

A 

k L J 


Bu 


where the [ ] ’s are the Stirling numbers of the first kind, and the B ' s are 
the Bernoulli numbers. 

Now one thing, at least, is clear from looking at this sum: it is not 
hypergeometric. The ratio of two consecutive terms is certainly not a ra- 
tional function of k. The Snake Oil method is, however, unfazed by this 
turn of events. If you follow the method exactly as before, you could define 
F(x) to be the egf of the sequence {f(n)}, multiply by x n /n\, sum on n, 
interchange the indices, etc., and obtain 


F(x) = ^ 


f{n)x r 


n\ 


n k 


n 

A 

k L J 


Bu 


n 


X 

n\ 






Bk k 




u 


1 — X 


X 


log 


(u = log 

(by (2.5.8)) 

1 


1 — x' 


1 — X 


(by (3.5.3)) 
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If we now read off the coefficient of x n /n\ on both sides, we find t-hat- 
the unknown sum is 


E 


Bh = 


(n — 1)! 
n + 1 


(n > 1). 


(4.3.21) 


Example 9. The scope of the Snake Oil method 

The success of the Snake Oil method depends upon being given a sum 
to evaluate in which there is a free variable that appears in only one place. 
Then, after interc hanging the order of the summations, one finds one of the 


basic power series (4.3.1) or (4.3.2) to sum. 

At the risk of diminishing the charm of the method somewhat by 
adding gimmicks to it, one must remark that in many important cases 
this limitation on the scope is easy to overcome. This is because it fre- 
quently happens that when an identity is presented that has a free variable 
repeated several times, that identity turns out to be a special case of a more 
general identity in which each of the repeated appearances of the free vari- 
able is replaced by a different free variable. Before abandoning the method 
on some given problem, this possibility should be explored. 

Consider the identity 


E 


2 n 

n — i 


3 n 
n 


At first glance the possibilities for successful Snake Oil therapy seem dim 
because of the multiple appearances of n in the summand. However, if we 
generalize the identity by splitting the appearances of n into different free 
variables, we might be led to consider the sum 



which is readily evaluated by the Snake Oil method. It is characteristic of 
the subject of identities that it is usually harder to prove special cases than 
general theorems. Multiple appearances of a free variable are often a hint 
that one should try to find a suitable generalization. 


4.4 WZ pairs prove harder identities 

Computers can now find proofs of combinatorial identities, including 
most of the identities that we did by the Snake Oil method in the previous 
section, as well as many, many more. In this section we will say how that is 
done. Although finding proofs this way requires more work than a human 
would care to do, the result, after the computer is finished, is a neat and 
compact proof that a human can often easily check, and can always check 
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with the assistance of one of the numerous symbolic manipulation programs 
that are now available on personal computers. Hence there is no need for 
blind trust in the computer. One can ask it to find a proof of an identity, 
and one can readily check that the proof is correct. 

These developments are quite recent, and they will surely change our 
attitudes towards, for instance, binomial coefficient identities. Instead of 
regarding each one as a challenge to our ingenuity, we can instead ask 
our computer to find a proof. It will not always succeed, but in the vast 
majority of cases it will. In fact, even more powerful methods are now 
becoming available, which promise a 100% success rate in certain classes of 
identities. 

This doesn’t mean that it was a waste of time to have learned the 
Snake Oil metho d. There were identities that Snake Oil handled that the 
method of [WZ] (which we’re about to discuss) cannot deal with, like the 


fact that J2k>o \n-k ) = ^ G1 n — 0; which was the first example in the 

previous section. Another one that offers no hope to the WZ method is 


(4.3.21) which involves Stirling and Bernoulli numbers. Also, to use the 
Snake Oil method, one doesn’t need to know the right hand side of the 
identity in advance; the method will find it. The method that we are about 
to describe will prove a given identity, but it won’t discover the identity for 
itself. 

With those disclaimers, however, it is fair to say that the method is 
quite versatile, and seems able to handle in a unified way some of the 
knottiest identities that have ever been discovered. 

It stems from some totally obvious facts, concatenated in a slightly 
un-obvious way. Suppose we want to prove that an identity 

^ U(n, k ) = rhs(n ) ( n = 0, 1, 2, . . .) 


is true. The first thing we do is to divide by the right hand side to get the 
standard form 

Y J F{n,k) = l (n = 0,1,2,...). (4.4.1) 

k 

So, in standard form, we are trying to prove that a certain sum is indepen- 
dent of n, for n > 0. 

To do that, rewrite (4.4.1) with n replaced by n + 1 and then subtract 
(4.4.1), to get 


^^{F(n + 1, k) — F(n,k)} = 0 (n = 0, 1, . . .) (4.4.2) 

fc 


Wouldn’t it be helpful if there were a nice function G(n, k) such that 
F(n + 1, k) — F(n, k) = G(n, k + 1) — G(n, k ), (4.4.3) 
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for then the sum (4.4.2) would telescope? In detail, that would mean that 


k=K k=K 

i F (n + 1, k) - F(n, k)} = { G ( n ’ k + x ) ~ G(n, k )} 

k=-L k=—L 

= G(n,K + 1) - G(n, —L). 

Well, as long as we’re wishing, why not wish for G(n, ±oo) = 0 too, for 
then, by letting K, L — > oo we would find that (4.4.2) is indeed true! This 
line of reasoning leads quickly to the following: 

Theorem 4.4.1. (Wilf, Zeilberger [WZ]) Let (F. G) satisfy 
suppose 

lim G(n,k) = 0 (n = 0, 1, 2, . . .). (4.4.4) 

Then the identity 

F(n , k) = const. (n = 0, 1, 2, . . .) 

k 


(4.4.3) 


and 


holds. 


Example 1. 

Suppose we want to prove the identity 


E 


n 


= 2 r 


(n > 0). 


(4.4.5) 


If we d ivide by the right hand side we find that the function F(n,k ) of 
(4.4.1) lis 


n 


F ( n i k ) = 2 n (n > 0). 

Now we need to find the mate G(n, k ) of this F. Well, it is 

(*- 1 ) 


(4.4.6) 


G{n,k) = 


2 n + 1 ' 


(4.4.7) 


That raises two questions. Does this G really work, and where did it come 
from? 


Let’ s take t 
first that (4.4.3) 


le easy one first. To check that it works we need to check 
holds, which in this case says that 


,-n-lfn+l 


- 2 " 


n 


= -2 


—n— 1 


n 


+ 2 


— n— 1 


n 

k - 1 
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After a few moments of work w e can sa tisfy ourselves that this is true. 
Further, the boundary conditions (4.4.4) are easy to verify, and we are all 
finished. ■ 


Definition. We say that an identity 

/ (4.4., 

l) 

is certified by a pair (F. G) 

(‘WZ pair’) if the conditions 

(4.4.3) 

(4.4.4) 

hold. 


(4.4.6), (4.4.7) 


Hence, th e simple identity (4.4.5) is certified by the pair (F. G ) of 


We can cut down still further on the complexity of the apparatus, as 
follows. It will turn out in the class of identities that we are discussing here 
that the mate G will always be of the form G(n, k ) = R(n , k)F(n, k — 1), 
where R(n. k ) is a rational function of n and k. Hence, instead of describing 
the pair (F, G), we need to give only F and R. But F comes directly from 
the identity that we’re trying to prove, just by dividing the summand by 
the right hand side. So if we have the identity in front of us, then the 
rational function R is the only extra certification that we need. 

The class of identities for whi ch the a bove simplification is true is the 

in which the function F has the 


class of those that are of the form 
property that both 

F(n + 1, k ) 
F(n, k ) 


(4.4.1) 


and 


F(n , k — 1) 
F(n, k ) 


are rational functions of n and k. This class includes just about every 
binomial coefficient identity that we have encountered or will encounter. 

Let’s recapitulate the complete proof procedure for an identity that is 
certified by a single rational function R(n, k): 

(a) Given an identity U(n, k) = rhs(n) ( n = 0, 1, 2, . . .), and 

also given a rational function proof certificate R(n, k ). 

(b) Define F(n, k ) = U(n, k)/rhs(n), for n > 0 and integer k. 

(c) Define G(n , k) = R(n, k)F(n , k — 1). 

(d) Check that the conditions (4.4.3) and (4.4.4) are satisfied. 

(e) Check that the identity is true when n = 0. 

(f) The proof of the identity is now complete. | 

Now here are some more examples of the technique in action. Do take 
the time to check one or more of these using the full proof technique (a)-(f) 
given above. 

Theorem. EJ-lf© f‘)4»-‘ = ( 2 „") (n > 0) 

Proof: Take i?(n, k) = (2k — l)/(2 n + 1). | 

Let’s check that statement, one step at a time, following the proof 
procedure above. From step (b) we find that 


F(n, k) 


.CHIhT 

O 
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From step (c) 


we find that 


G(n, A;) = (- 1) 


fe-i 


2k - 1 
2n + 1 


n 

k-l 


2k 

k- 


2\ /^n—k+l 


1 


o 


Now we know the pair (F. G ). In step (d) 


we must first check that the 
condition F(n + 1, k) — F(n, k) — G(ii , k + 1) — G(n, k ) is satisfied. At this 
point it would be very helpful to have a symbolic manipulation program 
available, for then one would simply type in the functions F and G. and 
ask it to verify that the condition holds. Otherwise, it’s a rather dull pencil 
and paper computation of five minutes’ length, and we will omit it. 


The second part of step (d) is the check of the boundary condition 


lim G(n, k) = 0 

k — ^ztoo 


(n = 0,1,2,...) 


That, however, is a triviality, because in this case not only are the limits 0, 
but the function G(n, k ) is 0 for every single value of k > n 1-1 and for all 
values of k < 1. 

In step (e) we quickly check and find that 1=1, and the proof is com- 
plete. | 

Theorem. J-l) fc (™)/( A: + a ) = a/{n + a) (n > 0) 

Proof: Take R(n, k) = k/(n + a). | 

Theorem. = O (" > 0 

Proof: Take R(n, k ) = ~(10n 2 — 6 kn + 17 n + k 2 — 5 k + 7)/(2(2 n — k + 2) 2 ). 


After the last three examples the reader will probably be wondering 
how to find the R(n, /c)’s, instead of just checking that an R(n, k ) snatched 
out of the blue sky seems to work. So we are going to tell that story 
too, because it’s a very important one, not just for these purposes but for 
symbolic manipulation in general. 

The problem is this: the fu nction F (n , k) is known, and we want to 
find G(n, k ) so that the identity (4.4.3) is true. Since F is known, so is 


F{n + 1, k) — F(n, k), so we might as well call it f(n, k ). Next, observe t-hat- 
at this moment the index n is a silent partner. That is, we are looking for 
G so that /(n, k) = G(n, k + 1) — G(n, k ), and we see that k is an active 
index but n is just a parameter, since n has the same value throughout. 
So we might as well suppress the appearance of n altogether, and state the 
problem this way: if fk is a given function of k (and other parameters), 
how can we find gk so that fk = gk+i ~ gk for all integers k ? 

We still don’t quite have the right question, but we’re getting there. 
The question as asked is a triviality. There is always such a gk and it is 


4.4 WZ pairs prove harder identities 


135 


just ^ ■ < k f j . Take the function ft = k, for instance. Then we can take gi~ 

tobe^o <j<kJ- 

To ask the right question we have to add the condition that the sum 
that represents Qk can be done in closed form. This whole problem is about- 
closed forms. What is closed form? Well roughly it means that the answer 
should be pleasant to look at and have no summation signs left in it. That 
idea is too nebulous to work with, so we will use one way of making it 
precise that has proved to be productive. 

Definition. A function fk of the integer k is a hypergeometric term if 
fk+i/ fk is a rational function of k. 

Thus k\ is a hypergeometric term. So is (3 k + 2)!/(5/c — 6)!, and so is 

( 2k ) 

( i \ k ati— k V k )_ 

1 ' m ' 


The function k k is not a hypergeometric term, nor is e 


v/ G The function 


a= E 

0<j<k 



is not obviously a hypergeomet-ric term, nor is it obviously not a hyperge- 
ometric term. The expression serves to define the function but does not 
immediately reveal the nature of the beast. 

Now we’re ready for the right question. 

Let fk be defined for integer k and be a hypergeomet-ric term. Does 
there exist a hypergeometric term gk such that fk = Qk+i -fjk for all integers 


fc? 


[Gos] 


gives a complete 


we input / to Gosper’s 


An algorithm that is due to R. W. Gosper, Jr. 
algorithmic answer to this question. That is to say, i 
algorithm it will then either return a function g with the desired properties, 
or it, will return a guarantee that no such function exists. It will not ever 
return a statement ‘I don’t know.’ We will not describe Gosper’s algorithm 
here because that would take us rather far afield from generating fun ctions. 


However, the reader is ur ged to co nsult either the original reference [Gos 


or the lucid explanation in [GKP], 


Gosper’s algorithm is built in to some of the commercially available 
symbolic manipulation packages. At- this writing (June, 1989) the algorithm 
is fully implemented in Macsyma, where it can be invoked with the nusum 
command, and it is partially implemented in Mathematica. No doubt, it, 
will be more widely available as its usefulness becomes recognized. 

Now here are the st atemen ts and proofs of two more general and dif- 
ficult identities, using the [WZ] method of proof by rational function certi- 
fication. 
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Theorem 4.4.2. (The Pfaff- Saalsch ii tz identity) 
(a + k)\(b + k)\(c — a — b + n — 1 — k)\ 


E 


Proof: Take 


(k + l)!(n — k)\(c + k)\ 

(a — 1)!(6 — l)!(c — a — b — l)!(c — a + n)\(c — b + n)\ 
(c — cl — l)!(c — b — 1 )\(n + l)!(c + n)\ 

(b + k)(a + k) 


R(n , k) 


(c — b + n + l)(c — a + n + 1) 


Theorem 4.4.3. (Dixon’s identity) 

Vf-1 )k( n + b \ (n + c\fb + c 
^ J l n + kj (c+kj ( b + k 


(n + b + c)\ 
nlblcl 


Proof: Take R(n, k) = (c+ 1 — k)(b + 1 — k)/(2(n + k)(n + b + c + 1)). 


4.5 Generating functions and unimodality, convexity, etc. 

The binomial coefficients are the prototype of unimodal sequences. A 
sequence is unimodal if its entries rise to a maximum and then decrease. 
The binomial coefficients { }fc=o do just that. The maximum (‘mode’) 
of the binomial coefficient sequence occurs at k = n/2 if n is even, and at 
k = (n ± l)/2 if n is odd. 

In general, a sequence Co, ci, . . . , c n is unimodal if there exist indices 
r, s such that 

Co E ci E C 2 E ■ ■ ■ E c r — c r -|-i — ■ — c r ~ |_ s A c r ~ i-s+i E ■ ■ ■ E c n . (4.5.1) 

Many of the sequences that occur in combinatorics are unimodal. 
Sometimes it is easy and sometimes it can be very hard to prove that a 
given sequence is unimodal. Generating functions can help with this kind 
of a problem, though they are far from a panacaea. 

A stronger property than unimodality is logarithmic concavity. First 
recall that a function / on the real line is concave if whenever x < y we 
have / ( (x + y)/ 2) > (f(x) + f(y))/2. This means that the graph of the 
function bulges up over every one of its chords. 

Similarly, a sequence Co, ci, . . . , c n of positive numbers is log concave 
if log c /f is a concave function of // , which is to say that 

(logc^-i + log c At _|_i)/2 < logc^. 

If we exponentiate both sides of the above, to eliminate all of the logarithms, 
we find that the sequence is log concave if 

c^-iCp+iEcJ (//= 1,2, ...,n — 1). (4.5.2) 

If, in (4.5.2) we can replace the ‘<’ by ‘<’, then we will say that the sequence 
is strictly log concave. 
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Proposition. Let {c r } g be a log concave sequence of positive numbers. 
Then the sequence is unimodal. 

Proof. If the sequence is not unimodal then it has three consecutive mem- 
bers that satisfy c r _i > c r < c r+ 1 which contradicts the assumed log 
concavity. | 

In many cases of interest, generating functions can help to prove log 
concavity of a sequence, and therefore unimodality too. The source of such 
results is usually some variant of the following: 

Theorem 4.5.2. Let p(x) = C 0 + C 1 X + C 2 X 2 -\ \-c n x n be a polynomial all 

of whose zeros are real and negative. Then the coefficient sequence {c r } q 
is strictly log concave. 

To prove the theorem we need to recall Rolle’s theorem of elementary 
calculus. It holds that if f(x) is continuously differentiable in (a, b), and 
if /(a) = f(b), then somewhere between a and b the derivative f must 
vanish. 

If / is a polynomial this can be considerably strengthened. Let u and 
v be two consecutive distinct zeros of /. Then by Rolle’s theorem there is 
a zero of f in (u, v ). Suppose / is of degree n, has only real zeros, and has 
exactly r distinct real zeros. Then Rolle’s theorem accounts for r — 1 of the 
zeros of f, because we find one between each pair of consecutive distinct 
zeros of /. The remaining n — r zeros of / are copies of the distinct zeros. 
But if xq is a root of / of multiplicity m > 1, then (x — xo) m is a factor of 
/, and so (x — xo) m ~ 1 is a factor of f. Thus xq is a zero of multiplicity 
m — 1 of f'. This accounts for the other n — 1 — (r — 1) = n — r zeros 
of f . In particular, the zeros of f are all real if the zeros of f are. For 
maximum utility in our present discussion, we summarize this discussion in 
the following way: 

Lemma 4.5.1. Let 


f(x, y ) = c 0 x n + cix n 1 y-\ b c n y n (4.5.3) 

be a polynomial all of whose roots x/y are real. Let g(x, y) be the result of 
differentiating f some number of times with respect to x and y. If g is not 
identically zero, then all of its zeros are real. 

Proof of theorem 4.5.2: Since the zeros of / are all negative, we have 

n 

f(x) = Co + C\X H f c n x n = + Xj), (4.5.4) 

j= 1 

where the x 3 ’s are positive real numbers. Hence none of the cfs can vanish. 
Now apply the differential operator to the polynomial f(x,y) 

of (4.5.3). Then only three terms survive, viz.: 


(n - m)c n - r 



(4.5.5) 
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We c an put t his in a cleaner form by writing Cj = ( n )pj , in which case the 


result 


(4.5.5) 


becomes 


n 


m + 1 


(p n — m— 2 X 2 + 2 p 

n—m— 1 xy+p n 

—mV ) • 


But this quadratic polynomial, according to 


lemma 4.5.1 


above, must have 


two real roots, and so its discriminant must be nonnegative, i.e., 


Pn—m — 1 — Pn—m—2Pn—mi 


and the sequence of p’s is log concave. If we substitute back the c’s, we find 
that 

2 ^ (m + 2)(n — m) 


'-'n—m — 1 — 


Cn— m— 2^-n — r 


( m + l)(n — m — 1) 

^ Cn— m— 2 C-n—m-i 

and the strict log concavity is established. | 

Corollary 4.5.1. The binomial coefficient sequence is log con- 

cave, and therefore unimodal. 

Proof. The zeros of the generating polynomial (1 + x) n are evidently real 
and negative. | 

Corollary 4.5.2. The sequence of Stirling numbers of the first kind 


Jk=i 


is log concave, and therefore unimodal. 


Proof. According to (3.5.2), the opsgf of these Stirling numbers is the 
polynomial 


E 



(x + l)(x + 2) ■ ■ ■ (x + n — 1), 


whose zeros are clearly real and negative, igj 

Corollary 4.5.3. The sequence of Stirling numbers of the second kind 
is log concave, and therefore unimodal. 

Proof. We’ll have to work just a little harder for this one, because the 
zeros of the polynomial 


A n(x) = ^ 
J 



X 


j 
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are not easy to find . They a re, however, real and negative, and here is one 


way to see that: by (1.6.8) 


we have the recurrence formula 
A n (y) = {y{ 1 + Ay)Mn-i(y) (n > 0; A 0 = 1), 

which can be rewritten in the form 

e y A n (y) = y (e y A n -i(y))' (rc>0;A o = l). (4.5.6) 


We claim that for each n = 0, 1, 2, . . ., the function e y A n (y ) has exactly 
n zeros, which are real, distinct, and negative except for the one at: y = 0. 
This is true for n = 0, and if it is true for 0, 1, . . . , n — 1, then (4.5.6) and 
Rolle’s theorem guarantee that (e y A n -i(y))' has n — 2 negative, distinct 
zeros, one between each pair of zeros of A„ _ i (y). After multiplying by y, 
as in (4.5.6), we have n — 1 negative, distinct zeros for e y A n (y), but we 
need to find still one more. But e y A n _i(y ) obviously approaches zero as 
y — ec . Hence its derivative must have one more zero to the left of the 
leftmost zero of A n -i(y), and we are finished. | 

The theorem is very strong, but one must not be left with the im- 
pression that unimodality or log concavity has something essential to do 
with reality of the zeros of the generating polynomials. Many sequences are 
known that are unimodal, and have generating polynomials whose zeros all 
lie on the unit circle, and are quite uniformly distributed, in angle, around 
the circle. In such cases our theorem will be of no help. 

For example, an inversion of a permutation a of n letters is a pair 
(i,j) for which 1 < i < j < n, but cr(z) > cr(j). A permutation may have 
between 0 and (”) inversions. It is well known that if 6(n, k ) is the number 
of permutations of n letters that have exactly k inversions, then 

{b(n,k)} k >o (1 + x){l -I- x + x 2 ) ■ ■ ■ (1 + x + x 2 H fx n_1 ). (4.5.7) 


The zeros of the generating polynomi al are very unifo rmly sprinkled around 


the unit circle, so the hypotheses of 


theorem 4.5.2 


are extravagantly vio- 


lated. Nonetheless, the sequence is unimodal; it rises steadily for *<0/2. 
and falls steadily thereafter. 
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4.6 Generating functions prove congruences 

In this section we give one or two examples of the power of the generat- 
ing function method in proving congruences among combinatorial numbers. 
A congruence between two generating functions means that the congruence 
holds between every pair of their corresponding coefficients. 


Example 1. Stirling numbers of the first kind 

We found, in chapter 3, that the Stirling numbers of the first kind [^] 
have the generating function 


E 


x 


x(x + l)(x + 2) • • • (x + n — 1). 


(4.6.1) 


Suppose we are interested in finding some criterion for deciding the evenness 
or oddness of these numbers. 

If we read (4.6.1) modulo 2, it becomes 


E 


fc L J 


n 


x k = x(x + l)x(x + 1) • • • (mod 2) 

= x rn/2] (x + l) Ln/2j . 

Now take the coefficient of x k on both sides, and find that 


n 

k 


[x k ]x^ n W (x + l) Ln/2j 


x k-\n/2\ 


(1 + x) L n / 2 J 


( K2J \ 

\k ~ \n/2y' 


(mod 2) 


(4.6.2) 


(4.6.3) 


Theorem 4.6.1. The Stirling number [)'] has the same parity as the bino- 
mial coefficient ( fc i™{y 2 -|). In particular, [^] is an even number if k < [n/2]. 


Example 2. The other Stirling numbers 

In the case of the Stirling numbers that count set partitions, the {^}’s, 
we found in ( 1 . 6 .5) | that they have the ops generating function 

k 


E 


x n = 


X 


that 


(1 — x)(l — 2x) •■■(! — kx ) ’ 

Again, suppose we read the equation modulo 2. Then we would find 

k 


E 


x 


x' u = 


= X 


(i — x)rfe/2i 

k ^2 ^ fc / 2 l + h — 1 


(mod 2) 


h 


x 
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Now take the coefficient of x n throughout. The result is: 


Theorem 4.6.2. The Stirling number {"} has the same parity as the 
binomial coefficient 


f\k/ 2] + n — k 
\ n — k 


(4.6.4) 


4.7 The cycle index of the symmetric group 

We have already studied the Stirling numbers of the first kind, which 
give the number of permutations of n letters that have exactly k cycles. 
Now we’ll look for much more detailed information about the cycles of 
permutations. Instead of considering only the number of cycles that a 
permutation has, we will be interested in the numbers of cycles that it has 
of each length. 

So let a = {ai, 02, <23, . . .} be a given sequence of nonnegative integers 
for which n = a\ + 202 + 3a3 + • • • is finite. How many permutations of n 
letters have exactly <34 cycles of length 1 and exactly a 2 cycles of length 2 
and etc.? For a given permutation cr, we will call the vector a = a(a) the 
cycle type of a. It tells us the numbers of cycles of each length that a has. 

Let c(a) denote the required number of permutations, and write 

<t>n{ x ) = ^2 c(o)x? 1 X 2 2 • • • • (4-7.1) 

a-^-\-‘2cL‘2-\- • ■ - = n 
a ± ^0,a,2 >0, . . . 


Then 4> n (x) is called the cycle index of the symmetric group S n . If we can 
somehow find (j) n (x) then the coefficient of each monomial x a is the number 
of permutations of n letters whose cycle type is a. 

We are going to find the “grand” generating function 


C(x,t) = J2 1 


[x 


n= 1 


n\ 


(4.7.2) 


which will turn out to have a surprisingly elegant form (see (4.7.5) below), 


considering the large amount of information that it contains. 

The derivation will be unusual in at least one respect. Most often a 
generating function is a way-station on the road to finding an exact formula 
for something. But in this problem we will begin by finding an exact formula 
for c(a). It will then be easy to check that its generating function really 
generates the sequence. 

Well, for a given a, how many permutations a have a for their cycle 
type? We will first prove a lemma, and then give the answer. 
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Lemma A. Given integers m,a,k. The number of ways of choosing ka 
letters from m (distinct) given letters, and arranging them into a cycles of 
length k is 


f(m,a, k) 


ml 

(to — ka)\k a a\ 


(4.7.3) 


Proof. First choose an ordered A’a-t.uple of the letters, which can be done 
in m\/(m — ka)\ ways. Then arrange each consecutive block of k letters in a 
cycle, which gives us our set of a cycles. However, we claim that every fixed 
set of a cycles of length k will arise exactly k a a\ times in this construction. 
Indeed, that set will occur in every ordering of the list of cycles (a! such). 
Furthermore, the same set of a cycles results from each of the k possible 
circular permutations of elements within blocks of k consecutive entries in 
the original A’a-t.uple, i.e., k a times. | 

Hence if we are given n letters, and a sequence of nonnegative integers 
ai, a, 2 , ■ ■ ■ such that 


ai + 2a 2 + 3a 3 H = n, 


then the number of ways of forming these letters into a \ 1-cycles and a 2 
2-cycles, and . . ., is evidently 


f(n, ai, 1 )f(n - a±, o 2 , 2 )f(n - ai - 2a 2 , a 3 , 3) 


n\ 


( n — ai)! 


(n — ai)!l ai ai! J \(n — a 1 — 2a 2 )!2“ 2 a 2 ! 


n\ 


ai!a 2 ! • ■ • 1 “i2“ 2 3“3 


(4.7.4) 


where Lemma A was used. This yields the following explicit formula for 
the number of permutations of each given cycle type. 


Theorem 4.7.1. Let a be nonnegative integers for which j a 3 = n. 
Then the number of permutations of n letters that have a for their cycle 
type is exactly 


c(a) 


n\ 

n ? >i («.?!.)“')' 


Next we’ll look for the generating function of the quantities c(a). Since 
there are infinitely many variables a we shouldn’t be surprised by the need 
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for a generating function in infinitely many variables. We calculate 


C(x,t) = 


4>n(x) 


n> 0 


n\ 


. +n 

v- y 

7l> 0 a 1 +2a 2 + --- = n 

a-± >0,a.2 >0, . . . 


c(a)x 


J2 ET E <a)x a ^x 


n\ 

Tl>0 a 1 +2a 2 H —n 

a-± >0,a,2 >0, . . . 


E 


(txi) 


ai 


E 


a l ry a 2 
2 


«2 


(t 2 x 2 ) 

^ l ai ai! /V ^ 2“ 2 a 2 ! 

ai>0 ' 'a2>0 

= e txi e t2x2 ^ 2 e t3x3 ^ . . . 


/ X 0^ \ 

ex p CL — -)• 

i>i J 


We have proved the following result. 
Theorem 4.7.2. The coefficient oft n /n\ in 


C(x,t) = exp(^^2_) 

k 3 


(4.7.5) 


(4-7.1) 


above, 


possible cycle 


is the cycle index of S n , i.e., the generating function 4> n (x) in 
of the numbers of permutations of n letters that have each 
type. In more detail, the coefficient of x a t n jn\ is the number of permuta- 
tions of n letters whose cycle type is a. f£ 


If this result rather reminds you of the exponential formula, and if you 
suspect that there must be some connection, you are quite correct. The 


result of exercise 22 of the previous chapter is a generalization of theorem 


4.7.2 to exponential families. Indeed, the theorem is an immediate special 
case of the result of that exercise, but we thought it might be interesting 
to give an elementary proof also. 

Thus the generating function C(x,t ) in (4.7.5) generates the cycle in- 
dexes of all of the symmetric groups. We will now give some of its applica- 
tions to the probabilistic theory of permutations. 

The polynomials & n ( x ) of (4.7.1) have coefficients that give the number 


of p ermutati ons of n letters with given cycle type. If we divide them by n\, 
as in 


(4.7.2) then since n\ is the total number of permutations of n letters, 
we will then be finding the probabilities that a permutation has various 
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cycle type vectors. Thus 


C(x, t) 



n 


Y Pn(x)t n 


(4.7.6) 


where 


Pn(x) = E Prob(a, n)x c 


(4.7.7) 


a l ^0> a 2 • • • 


and Prob(a,n) is the probability that a permutation of n letters has the 
cycle type a. 

Just ahead of us now there lie some very pretty theorems. They are 
theorems that give very quantitative answers to questions that don’t seem 
to have any quantities in them. For instance, take this question, which is a 
simple illustration of the genre: what is the probability that a permutation 
has no fixed points? 

Notice that the question doesn’t tell us how many letters the permu- 
tation permutes. There is no ‘ n’ in the question. But it has a nontrivial 


answer: 1/e, as we discovered in (4.2.10). To interpret a question like this, 
one proceeds as follows. Let f(n) be the number of permutations of n letters 
that have no fixed points. Then f(n)/n\ is the probability that a permu- 
tation of n letters has no fixed points. Since, in this case, lim n _ i . 00 f(n)/n\ 
exists and is equal to 1/e, we can then say that, in this precise sense, the 
probability that a permutation has no fixed points is 1/e. 

There are many lovely questions about permutations that sound like 
what is the probability that a permutation has ...?, in which the number of 
letters that the permutations act upon is not even mentioned, and to which 
the answers are nontrivial numbers, like the 1/e above. We are about to 
derive handfuls of them at once. But first we need a lemma. 


Lemma B. Let )Tb bj be a convergent series, 
expansion of the function 


Then in the power series 


1 




we have that lim n a n exists and is equal to V • bj. 


Proof. By 


Rule 5 of section 2.2 


a n is the sum of those bj for which j < n. 


The latter sum clearly approaches the limit stated. jgj 

Now let S be a (finite or infinite) set of positive integers, with the 
property that 


1 
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In the generating function C(x,t) we set all ay = 1 for i ^ S. That means 
that we are declaring ourselves to have no interest in any cycle lengths other 
than those in S. The others can be whatever they please. Then C becomes 


c(x,t ) = exp ( T 
ies i£s 

I (%i — 1 )t l , 1 

exp I 2^ + log 


ies 


1-t 


— ex p(E 


{Xi - 1 )t l 


ies 


Lemma B 


By 
the limit 


above, the coefficient of t n in this last expression approaches 

(Xi - 1 )' 


exp 


E 

ies 


as n — > oo, which proves the following. 


Theorem 4.7.3. Let S be a set of positive integers for which YlseS V s 
converges, and let a be a fixed cycle type vector. The probability that 
the cycle type vector of a random permutation agrees with a in all of its 
components whose subscripts lie in S exists and is equal to 


e -(E ses vs) [a . a ] exp QT-) 

ses s 


1 

Els e5 (e-s°»o s !) 


(4.7.8) 


As a first example, take S = {1}, so we are interested only in fixed 
points. Then from (4.7.8), the probability that a random permutation has 
exactly a fixed points is l/(a!e), for a > 0. 

Take S = {r}. Then we see at once that the probability that a random 

permutation has exactly a r-cycles is 1 / (e“r"a!), for each a = 0, 1, 

Let S = {r, .S'}. The probability that a random permutation has exactly 
a r r-cycles and exactly a s .s- cycles is therefore 

e -l/r-l/s 


OK, now blindfold yourself, reach into a bag that contains every per- 
mutation in the world, and pull one out. What is the probability that none 
of its cycle lengths is the square of an integer? We claim that the probabil- 
ity is e -7r / 6 = .193025 . . ., so the odds are about 5 to 1 against this event. 
Indeed, this is just the case where we take S to be the set of all squares of 
integers, and use the fact that 1 + 1/2 2 + 1/3 2 + ■ • • = 7t 2 /6. 

For a final example, what is the probability that a randomly chosen 
permutation contains equal numbers of 1-cycles and 2-cycles? Well, if that 
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number is j. then the probability is e 3 / 2 / (2 J j ! 2 ), for each j = 0, 1, 2, . . .. 
If we sum over all of these j, we hnd that the required probability is 


3/2 


E 

3=0 


2 3j\ 


12 


= 0.34944033 .... 


We can recast the result in 


theorem 4.7.3 


in the language of the Poisson 


distribution. The Poisson distribution is a probability distribution on the 
nonnegative integers j = 0, 1, 2, . . . that occurs very naturally in a number 
of areas of application, such as in the theory of waiting lines. It is given by 


Prob(j) = e 


-M 




0 ' = 0 , 1 , 2 ,...) 


where M is the mean. 


Theorem 4.7.3 then asserts the following: If a set S is fixed, for which 


S ses 1/s < oo, then for a randomly chosen permutation the numbers 
of cycles of each length s G S have asymptotically independent Poisson 
distributions in which the mean number of s-cycles is 1/s for each s G S. 


4.8 How many permutations have square roots? 

Let a be a permutation. There may or may not be a permutation r 
such that a = r 2 . We want to describe and to count the cr’s that do have 
square roots, in this sense. More generally, a has a kill root if there is 
a r such that a = r fc , and again, we would like to know the number of 
permutations of n letters that have kth roots. 

The answers to these questions involve some fairly spectacular generat- 
ing functions, and the methods will lean stro ngly on the cyc le index results 


of the previous section, and in particular on theorem 4.7.2 
We begin with the square root problem, 
and consider a single cycle of r, say this one: 


So let r be a permutation, 


13 -*• 19 -> 7 -*• 12 ->• 8. 


What happens to that cycle when we square r? The mapping r 2 executes 
the permutation r twice, so it carries 8 into 13 and 13 into 7, etc. Thus 
we go hopping around the cycle of r, visiting every second member, until 
we return to our starting point. A cycle of even length therefore falls apart 
into two cycles of half the length, while an odd cycle remains a cycle of the 
same length, although it becomes a different cycle of that length. 

In the example above, the cycle shown breaks into two cycles, like this: 

8 -> 13 -*• 7 — > 8 and 3 -> 19 -*• 12 -> 3. 


In general, every cycle of r whose length is 2m will contribute two cycles 
of length m to r 2 . 
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A cycle of even length in r 2 , therefore, can only be the result of splitting 
a cycle of twice its length in r into two cycles. Hence, if a has a square 
root, then the number of cycles that it has of each even length must be even. 

Conversely, let a be any permutation that has this property. Then we 
claim that a = r 2 for at least one r. In fact we can construct such a r (in 
how many ways?). To do that, pick up a pair of cycles of the same even 
length and thread them together into a single cycle of twice that length, as 
in the two examples above, by taking alternately a letter from one of the 
cycles and a letter from the other. These threaded cycles are all parts of 
the permutation r that is being constructed. 

What do we do with the odd cycles of cr? If we are given a cycle of 
odd length 2m + 1 in a we convert it into a cycle of the same odd length 
in r as follows. Let the letters in the given cycle of a be 

ai — > 02 — > 03 — > • • • — > a^m — > CL2m+l — 1 ► CLl- 


Then into r we put the cycle 


Oi > O m _|_2 * O2 * O m _|_3 > O3 > O m _|_4 > • • • > ® 2 m+l * 1 - 


This cycle clearly has the property that if we square it then it will be 
back in the original order as in cr, and completes the proof of the following 
theorem (as well as giving us an algorithm for finding the square root of a 
permutation!). 


Theorem 4.8.1. A permutation cr has a square root if and only if the 
numbers of cycles of cr that have each even length are even numbers. Jg| 


Now let /(n, 2) be the number of permutations of n letters that have 
square roots. We seek the generating function of the sequence. Consider the 
cycle type vector a of such a permutation. The even-indexed components 
must be even numbers, and the odd-indexed components are arbitrary. 

According to theorem 4.7.2 the coefficient of x a t n /n\ in the product 


^x\t ^x^t 2 / 2 ^x^t 3 / 3 _ _ _ 


is the number of permutations of n letters whose cycle type is a. The sum of 
these coefficients over all of the cycle types that we are considering, namely 
a’s for which the even-indexed entries are even, is obtained as follows. In the 
product of the exponential functions above, put x\ = 1, because all values 
of ai are admissible. In the second exponential factor, e X2t / 2 , don’t use 
the whole exponential series. Since only even powers of X 2 are admissible, 
use the subseries of even powers of the exponential series, namely the cosh 
series, and then put X 2 = 1. Then put X 3 — 1. Then use the cosh {xyt^/A) 
series and put X 4 = 1, and so forth. 
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The result will be that the number f(n, 2) of permutations of n letters 
that have square roots satisfies 

+n „ _ 

y] f(n , 2)— — e t cosh (t 2 /2)e t ^ cosh (f 4 / 4)e* / 5 • • • 


n> 0 


4.2m 


= exp (£ + f 3 /3 + f 5 /5 -I ) TT cosh ( — — ) 

- LJ - 2m 


m>l 


‘rr f] cosh( * 


2m 


1 — t 1 1 2m 

m> 1 

t 2 t 3 f 4 f 5 

= 1 + t + -r + 3^ + 12^— + 60 y + 


e 

2! ' ”3! ' 24 ' ““5! 

Hence the sequence {/(n, 2)} begins as 

1,1,1, 3, 12, 60, 270, 1890, 14280, . . . . 


(4.8.1) 


Corollary. Let pin ) be the probability that a permutation of n letters has 
a square root. Then for each n = 0,1,2,..., we have p(2n) = p(2n + 1). 

Proof. The generating function in (4.8.1) is of the form 1/(1 — t) times an 
even function of t. Hence f(n, 2 )/n\ is the nth partial sum of the coefficient 
sequence of an even function. | 

More generally, the sequence of probabilities is actually weakly de- 
creasing, i.e. , 


p( 0) = p( 1) > p( 2) = p( 3) > p( 4) = p( 5) > • • • . 

Bijective proofs of these facts have been found by Dennis White (p.c.). 

Now let’s try the question of kth roots. We let f(ri. k ) be the number 
of permutations of n letters that have kth. roots, and we seek the egf of 
{/(n, b)} n >o- Some additional notation will be helpful here. For a prime p 
and an integer n we will write e(p, n ) for the highest power of p that divides 
n. Next, for a pair m, k of positive integers, we define ((m, k)) to be 

((m, k]) = p e (pT) _ 

p\m 


First, if k is given, for which permutations a is it true that there exists 
a permutation r such that a = r fc ? The generalization of theorem 4.8.1 
the following. 


is 


Theorem 4.8.2. A permutation a has a kth root if and only if for every 
m = 1,2, ... it is true that the number of m-cycles that <j has is a multiple 
of ((m, k)). 

To prove this, let a = r k be a permutation of n letters, and suppose 
a has exactly v m cycles of length m, for each m = 1,2, — Consider a 
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cycle of length r in r. In r k this contributes (r, k) cycles of lengths r/ (r, k ). 
Hence the z/ m cycles of length m in a must come from cycles of length r in 
r where r/ (r, /e) = m. From this equation r = m(r, /c) it is easy to see that 
r must be a multiple of m((m, /e)). Hence the cycles of length m in a all 
come from cycles of lengths that are multiples of m((m, k )) in r. But every 
such cycle in r contributes a multiple of ((m, kj) m-cycles in a. Hence the 
number of m-cycles in a must be a multiple of ((m, kj). 

To show that it is also sufficient, let a be a permutation that satisfies 
the condition. We will construct a kth root, r, of a. Fix m, and write 
g = ((m, k )) . Then the number of m-cycles of a is a multiple of g, so we can 
tie them up into bundles of g m-cycles each, and then for each bundle we 
can construct a single new cycle of length mg as follows: construct a circle 
with mg places marked consecutively around it. Take the first m-cycle 
in the bundle and arrange its elements in the marked places, consecutive 
elements being spaced apart by g places. Then do the same for the second 
m-cycle in the bundle, etc. Repeat for each m to complete the proof. | 


Theorem 4.8.2 is due to Arnold Knopfmacher and Richard Warlimont, 
and it corrects an error that appeared in this discussion in the previous 
printing of this book. 


To obtain the egf of the sequence {/(n, k)} we proceed as in (4.8.1) 
above. It will be convenient to have a name for the subseries of the expo- 
nential series that occur. So let us write exp g (x) for the subseries of the 
exponential series e x that is obtained by choosing only the powers of x that 
are divisible by q. That is 


exp g (x) 


E 


x jq 

UqV- 


(q = 1,2,3,...). 


Thusexp 1 (x) = e x , exp 2 (:r) = coshx, exp 3 (x) is explicitly sh own in 


(2.4.7) of chapter 2, etc. Now following the argument that led to (4.8.1) 
obtain this generalization. 


eq. 

we 


Theorem 4.8.3. Let f{n, k ) be the number of permutations of n letters 
that have a kth root. Then we have 


J2f(n,kf-=U^O (k = 1,2,3,...). (4.8.2) 

n= 0 m= 1 


The rea der is in vited to check that this reduces to a triviality when 
k = 1, and to (4.8.1) when k = 2. A short table of f(n, k) (1 < n < 10; 2 < 
k < 7) is shown below. 


150 


Applications of generating functions 


4 


k 

= 2 

1 1 

3 

12 

60 

270 

1890 

14280 

128520 

1096200 

k 

= 3 

1 2 

4 

16 

80 

400 

2800 

22400 

181440 

1814400 

k 

= 4 

1 1 

3 

9 

45 

225 

1575 

11130 

100170 

897750 

k 

= 5 

1 2 

6 

24 

96 

576 

4032 

32256 

290304 

2612736 

k 

= 6 

1 1 

1 

4 

40 

190 

1330 

8680 

52920 

340200 

k 

= 7 

1 2 

6 

24 

120 

720 

4320 

34560 

311040 

3110400 


4.9 Counting polyominoes 

By a cell we will mean the interior and boundary of a unit square in the 
x-y plane, if the vertices of the square are at lattice points (points whose 
coordinates are both integers). Let P be a collection of cells. We associate 
with P a graph, whose vertices correspond to the cells of P, and in which 
two vertices are joined by an edge in the graph if the two cells to which they 
correspond intersect in a line segment (rather than in a vertex, or not at- 
all). We say that P is a connected collection of cells if the graph associated 
with P is a connected graph. 

A collection P of cells is in standard position if all of its cells lie in the 
first quadrant, and at least one of them intersects the y axis and at least 
one of them intersects the x axis. 

A polyomino is a connected collection of cells that is in standard posi- 
tion. 

Here are all of the polyominoes that have one, two, or three cells: 


Sometimes polyominoes are called animals. This is because one can 
imagine a single cell that ‘grows’ by sprouting a new cell along one of its 
edges. Then that two-celled animal would grow a new cell along one of its 
edges, etc. If f(n) is the number of n-celled polyominoes, then from the 
picture above we see that {/(n)} = 1, 2, 6, — It would be good to be able 
to say that in this section we are going to derive the generating function etc. 
for the sequence f{n). We aren’t going to do that, though. The sequence 
and its generating function are unknown, despite a great deal of effort that 
has been invested in the problem. 

Various special kinds of polyominoes, however, have been counted, 
with respect to various properties of the polyomino. For instance, among 
the properties that a polyomino has, one might mention its area, or number 
of cells, and its perimeter. So one might ask for the number of polyominoes 
of some special kind whose area is n, or the number whose perimeter is m, 
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or the number whose area is n and whose perimeter is m, or the generating 
functions of any of these, et c. For a survey of recent progress in such 


questions see [Del] and [De2] 

What we will do in this section will be to count a special kind of 
polyomino that is called horizontally convex (HC). An HC-polyomino is 
one in which every row is a single contiguous block of cells. The picture 
below shows a typical HC-polyomino. 


Another special kind of polyomino is called convex. A polyomino is 
convex if it is both vertically and horizontally convex. One of the striking 
results in the theory of polyominoes is the fact that there are exactly 


(2n + ll)4 n -4(2n + l) 


2 n 


n 


(4.9.1) 


convex polyominoes of perim eter 2 n + 
found by M. Bousquet-Melou 


The number of area n has been 


Bo 


There are interesting problems involved in counting HC-polyominoes 
either by area or by perimeter. We are going to count them here by area. 
It is worth notin g that t he question of enumerating them by perimeter has 
also been solved 
function, which 


[De2],|and the solution involves a remarkable generating 
looks like this: if c n is the number of HC-polyominoes 


whose perimeter is 2n + 2 then 


E 

n> 0 


c„t n = 


vMAC 1 / 3 + D + EC- 1 / 3 ) - F H 


2 VAH 


2y / 2 A 


G 


in which A, B , . . . , H are certain specific functions. For instance A = 
18f 4 (2f 3 — 23f 2 + 38t — 18) 2 . For the complete list see [De2] 

We return to the problem of counting HC-polyominoes by area, which 
is similar to the enumeration of ‘fountains’ in section 2.2, and our method 
of attack will be similar. 

Let /(n, k, t) be the number of HC-polyominoes of n cells, having k 
rows, of which t are in the top row. If we strip off the top row of one of 
these polyominoes, what will remain will have n — t cells, arranged ink — 1 
rows, with some number r > 1 in the top row. Hence after removing the 
top row, there are f(n — t, k — 1, r) possibilities for what remains, for some 
r. However, each one of those possibilities generates r + t — 1 of the original 
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(■ n,k,t ) HC-polyominoes, by adjoining a top row of t cells, and sliding it 
left and right through all legal positions atop the second row. 

Hence we have 

f(n,k,t) = ^2f(n- t,k - 1 ,r)(r + t- 1) (fc > 2; /(n, l,t) = S t , n )- 

r>l 

(4.9.2) 

If we define the generating functions F k ,t(x) = /(n, fc, t)a; n , then we 

have F\j{x) — for t > 1 and after multiplying (4.9.2) by x n and sum- 
ming over n, we obtain 

F k,t( x ) = x l y^(r Ft - l)F fc _i ir (x) (fc > 2). (4.9.3) 

r>l 

Now let f7fc(x) = Y^r>i F k,r( x ) and ^fc( x ') = Er>i r ^tW- Then 
C/i(x) = x/(l — x) and V\(x) = x/(l — a;) 2 . Further, from (4.9.3), 

iM*) = ^(H fc _i(a:) + (t - l)Z7 fc _i(a;)) (A > 2), (4.9.4) 

and if we sum on f we find that 

2 

740*0 = — ^ — Vfe-i(x) + ^ U k -i{x) (fc > 2). (4.9.5) 

1 — x (1 — a?) 

If we first multiply (4.9.4) by t and then sum on t we find 

/v> O /y>2 

= ^-^I4-i0*0 + ^Fn^) (fc > 2). (4.9.6) 

We now have two simultaneous recurrences to solve for the sequences 
U k and Vfc. To do that we eliminate the 14 sequence as follows: solve (4.9.5) 
for I4_i in terms of U k and U k ~i, and substitute the result in (4.9.6). After 
simplification we obtain a single three term recurrence for the I/'s, viz. 

l —^U k+l {x) - ± Uk (x) - — U k . i(s) = 0 (fc > 1), (4.9.7) 

x 1 — a: (1 — a:) 13 

along with the initial data Uq(x) — 0 and U\(x) = x/{\ — x). 

Finally, to solve (4.9.7) we introduce the generating function <p(x, y ) = 
E fc >o^(*)y fc - Then, if we multiply (4.9.7) by y k and sum over fc > 1 we 
get 


x 


<Kx,y) - U x {x)y \ - - 


2 


i 


X 


(1 -a;) : 


:Mx,y) = 0 . 


xy 
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If we use the initial conditions and solve for the result is that 


= Y U k (x)y k = Y f( n > k i r ) 

n,k,r 

xy( 1 — x ) 3 


x n y k 


k> 0 


(4.9.8) 


(1 — a;) 4 — xy( 1 — x — x 2 + x 3 + x 2 y) 


Notice that the sum over r has no variable attached to it; it acts 
directly on /(n, k. r) and yields the number of HC-polyominoes of n cells 
and k rows, without regard to how many cells are in the top row. Thus if 
g{n , k) is that number, then 


Y 9 ^ k ^ 

n,k 


x-y k = 


xy{ 1 — x) c 


(4.9.9) 


(1 — x ) 4 — xy( 1 — x — x 2 + x 3 + x 2 y) 

For the complete 3- variable generating function of the sequence {/(n, k, r)}, 


see 


exercise 21 at the end of this chapter. 


Perhaps we are interested only in the total number of HC-polyominoes, 
and we don’t need to know the number of rows. In that case we let y = I 
in (4.9.8) and we find the following result, which is due to D. Klarner, who 
used different methods. 

Theorem 4.9.1. Let f(n ) be the number of n-celled HC-polyominoes. 
Then 




x n = 


x(l — x) c 


n> 1 


1 — 5a; + 7x 2 — 4a; 3 


= x 2x 2 + 6x s + 19x 4 + 61x b + 196ar + 629ad + 2017a: 8 

+ 6466a; 9 + 20727a; 10 + 66441a; 11 + 212980a; 12 + • • • 

(4.9.9) 

We now will give a preview of the material in chapter 5, by working out- 
an asymptotic formula for f(n). To do this we take the generating function 
in (4.9.9) and expand it in partial fractions. This gives 


x{\ — x)' : 


1 — 5a; + 7a; 2 — 4x 3 


5 x ci c 2 c 3 

16 ^ 4^1 — fix 1 — &X 1 — £ 3 a; ’ 


in which = 3.20556943... and ^ 2>3 = 0.897215 ± ,665457b 
Thus the number of HC-polyominoes is, for n > 2, 


/( n ) = ciff + c 2 ^2 + c 3^ 3 

= ^ + 0 ( 161 ") 

= 0.1809155018 . . . (3.2055694304 . . ,) n + 0(1.1171 n ). 
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The first term of this formula gives, for exa mple, /( 12) 


compared to 212980, the exact value, as shown in (4.9.9) 


212979.61, 


4.10 Exact covering sequences 

Every positive integer n is either 1 mod 2 or 0 mod 4 or 2 mod 4, as 
a moment’s reflection will confirm. So the three pairs (04,61) = (1,2), 
(02,62) = (0,4) and (03,63) = (2,4) of residues and moduli exactly cover 
the positive integers. 

An exact covering sequence (ECS) is a set (a*, 6*) (i = 1 of 

ordered pairs of nonnegative integers with the property that for every non- 
negative integer n there is one and only one i such that 1 < i < k and 
n = Oj mod 6,;. 

In this section we will give the basic theory of such sequences and deal, 
in two or three different ways, with the question of how we can tell if a given 
sequence of pairs is or is not an exact covering sequence. 

Here is what generating functions have to contribute to this subject. 

Suppose ( ai,bi ) (i = 1, . . . , k) is an exact covering sequence. Then in 
the series 

k 

i= 1 t>0 

every nonnegative integer n occurs exactly once as an exponent of x. so it 
must be true that the series shown is equal to 1/(1 — x ) . If we perform the 
summation over t. we find that 


E 


x 


<H 


1 — X bi 


1 

1 — X 


(4.10.1) 


For example 

x 1 x 2 1 

1 — x 2 1 — x 4 ^ 1 — x 4 1 — x 

Theorem 4.10.1. For a set of pairs (a*, 6/ (i = 1, . . . , k) to be an exact 
covering sequence it is necessary and sufficient that the relation (4.10.1) 
hold. 

One conclusion that we can draw immediately is that in an ECS we 
must have JT 1/6 i = 1. To see that, just multiply (4.10.1) by 1 — x and 
let x — > 1. But we can learn much more by comparing the partial fraction 
expansions of the left and right sides of (4.10.1). 

For the left side, we have 


E 


x 


CLi 


1 — x bi 


E 

uj:ui n = 1 


A(uj) 


cv — X 
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where 


A(oj) = lim (uj — x) y — 

x—±oj — J 1 


X 


CLi 


i = 1 
UJ a i +1 

1 ^T’ 

j:uj 3 =1 


X 


bi 


E 


and IV = l.c.m.{6j}. If we compare with the right side of (4.10.1) we see 
that the H(w)’s must all vanish except that A( 1 j = 1. Hence we must have 


v u) aj f 1 , if uj = 1; 

bj I 0, otherwise. 

b „• n J ^ 

j:tj 3 =1 


Now H(w) surely vanishes unless u is a root of unity. So let u = e 27rir / s , 
where s > 1 and (r. s) = 1, be a primitive sth root of unity. Then our 
conditions take the form 


E 

r-s\bj 



1 if s = 1; 

0 otherwise. 


(4.10.2) 


Hence, associated with any sequence of pairs (oj,6j) |^ =1 we can define 
polynomials 


ips(z)= ~k~ (s = !,2,3, ...). 


(4.10.3) 


j:s\bj 


In terms of these polynomials we can restate our conditions (4.10.2) as 
follows: necessary and sufficient for the given set of pairs to constitute an 
exact covering sequence is that for each s > 1 the polynomial ijj s should 
vanish at the primitive sth rots of unity, and ?/> i(l) = 1. 

However, any polynomial that vanishes at all of the primit ive sth roots 
of unity must be divisible by the cyclotomic polynomial 
example 2) 


see 


section 2.6, 


n 

r:(r,s) = l;0<r<s 


2nir/s 


) (s = 1,2,3,...) 


since d> s (z) has only those roots. The first few cyclotomic polynomials are 

1 — z, 1 T -2, 1 4 2 4 z 2 1 1 T z 2 , 1 T z T z 2 T z 2, T z^, 1 — z T z 2 , .... 


If we put all of this together, we obtain the following result. 
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Theorem 4.10.2. A set of pairs of integers (aq, b \), . . . , (a*,, bk), in which 
the a’s are nonnegative and the b’s are positive, is an exact covering se- 
quence if and only if JT 1/bj = 1 and for each s > 1, the polynomial 
of (4.10.3), is divisible by the cyclotomic polynomial $ s (z). g 

For an example, take the pairs 

(0,4), (2, 4), (1,6), (3, 6), (5, 12), (11, 12). 

Then J2j 1 /bj = 1/4 + 1/4+1/6 + 1/6 + 1/12 + 1/12 = 1, and the divisibility 
conditions of the theorem look like this: 


$ 2 (^) = (1 + z) divides 

4*3(2:) = (1 + 2: + z 2 ) divides 
$4(2:) = (1 + z 2 ) divides 

& 6 (z) = (l — z + z 2 ) divides 
4*12(2:) = (1 - z 2 + z 4 ) divides 


| + z 2 / 4 + z/6 + z 3 / 6 + z 5 /12 + z 11 / 12 
z/6 + z 3 /6 + z 5 /12 + z n /12 
± + z 2 /4 + z 5 /12 + z u /12 
z/6 + z 3 /6 + z 5 /12-I-z 11 /12 
z 5 /12 + z u /12. 


These are all readily checked, and so the given pairs are an ECS. 

Theorem 4.10.2 has a number of corollaries, some of which are left as 
exercises. One of them, however, is quite clear. If B = max { bj } . then B 
cannot occur just once among the moduli {bj}. Indeed, if we take s = B 
in the theorem, we discover that ipB (z) must have enough monomials in 
it to allow it to be divisible by 4*5(2:), so it must surely have at least two 
monomials. 
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Exercises 


1. Given a coin whose probability of turning up ‘heads’ is p, let p n be the 
probability that the first occurrence of ‘heads’ is at the nth toss of the coin. 
Evaluate p n and the opsgf of the sequence {p n }- Use that opsgf to find the 
mean of the number of trials till the first ‘heads’ and the standard deviation 
of that number. 


2. In the coupon collector’s problem we imagine that we would like to get 
a complete collection of photos of movie stars, where each time we buy a 
box of cereal we acquire one such photo, which may of course duplicate one 
that is already in our collection. Suppose there are d different photos in 
a complete collection. Let p n be the probability that exactly n trials are 
needed in order, for the first time, to have a complete collection. 


(a) Show that 


Pn = 


d n 


wher e is the Stirling number of the second kind (see section 

1.6)[ 

(b) Let p{x) ^ {Pn}- Show that 


p(x) 


(d — l)!x d 

(d — x)(d — 2x) ■ ■ ■ (d — (d — l)x) 


(c) Find, directly from the generating function p(x), the average num- 
ber of trials that are needed to get a complete collection of all d 
coupons. 

(d) Similarly, using p(x), find the standard deviation of that number 
of trials. 

(e) In the case d = 10, how many boxes of cereal would you expect to 
have to buy in order to collect all 10 different kinds of pictures? 

3. ( First return times on trees ) By a random walk on a graph we mean 
a walk among the vertices of the graph, which, having arrived at some 
vertex v , goes next to a vertex w that is chosen uniformly from among the 
neighbors of v in the graph. 

If T is a tree, and v is a vertex of T, let p(j] v] T) denote the probability 
that a random walk on T which starts at vertex v. returns to v for the first 
time after exactly j steps. 

Now let Ti, T 2 be trees, let v % be a vertex of T, for i = 1,2, and let 
T be the tree that is formed from these two by adding edge (ui,u 2 ). 
Finally, let Fi(x;vi), u 2 ), F{x]V\ ) be the opsgf’s of the sequences 

{p(j;m;Ti)}j>o, {p(j-,v 2 ]T 2 )}j> o, and {p(j\ m; T)}j> 0 , respectively. 
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(a) Show that 


F{x;v i) 


1 


d\ + 1 


dxF^vi) + 


x 


d 2 -\- 1 — d 2 F 2 [x] v 2 ) 


5 


where <7, is the degree of vertex v t in the tree T r . for % = 1,2. 

(b) Let ht(cl) be the average number of steps in a random walk that 
starts at vertex a E T and stops when it returns to a for the first 
time. Show, by differentiating the answer to part (a), that 


dr(vi) = (2 + di^nOi) +d 2 HT 2 (v2))- 

d i + i 


(c) Let the tree T be a path of n + 1 vertices. Show that the mean 
return time of a walk that begins at vertex v is 2 n if v is one of 
the two endpoints, and is n for all other v (surprisingly?). 

(d) Again, if T is a path of n vertices, and if P n (x ) denotes the gener- 
ating function F(x; V\) of part (a), where V\ is an endpoint of T, 
then find an explicit formula for P n {t). 

4. Find, in terms of N(x), the opsgf of the sequences {e< m } (resp. {e> m }) 
which count the objects that have at most m properties (resp. at least m 
properties). 


5. What chessboard would you use to derive the number of permutations 
that have no fixed points? Rederive the formula for this number using the 
chessboard method. 


eq. (4.2.6) 


computes the 


6. ( Bonferroni’s inequalities) In the sieve method, 
number of objects that have no properties at all. Suppose the alternating 
series on the right were cut off after a certain value t = m, say. Show that 
the result would overestimate eo if m were even, and underestimate it for 
m odd. 


To do this, show that the sequence 


am=J2 (-! Y~ m N r (rn = 0 , 1 , 2 , . . .) 

r>m 

has the opsgf eo + x ^2 r>0 e r +i(x + l) r , whose coefficients are obviously 
nonnegative. 

7. ( Bonferroni’s inequalities, cont .) Not only is eo alternately under- and 
overestimated by the successive partial sums of its sieve formula, the same 
is true of every efc, the number of objects that have exactly k properties. 
To show this generatingfunctionologically, define, for each M> o, 


7 (M) = (-1) 


t + 1 


efc - 

:i<t 


k+j 

j 


Ni 


k+j 
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Then the problem is to show that all 7 (k, t ) > 0. 
To do this, 


(a) let T(x,y) be the 2-variable opsgf of the 7 ’s. Then multiply the 
definition of the 7 ’s by x k y t , sum over k,t > 0 , and show that 


r (x,y) 


E(x + (1 + y)) - E(x ) 

(1 + 2 /) 


(b) It now follows that the 7 's are nonnegative, and in fact that 

7(M) = \ l \r (M> 0). 

r>fc V / \ / 


8 . Show that 

=( i+ z>( i+ * 2 >"- 

Then use Snake Oil to evaluate 

?(3(ivj )■* 

explicitly, when y = ±2 (due to D. E. Knuth). Find the generating function 
of these sums, whatever the value of y. 

9. Let G be a graph of n vertices, and let positive integers x, A be given. 
Let P(A; x: G) denote the number of ways of assigning one of A given colors 
to each of the vertices of G in such a way that exactly x edges of G have 
both endpoints of the same color. 

Formulate the question of determining P as a sieve problem with a suitable 
set of objects and properties. Find a formula for P(X;x;G), and observe 
that it is a polynomial in the two variables A and x. The chromatic poly- 
nomial of G is P( A; 0; G). 

10 . 

(a) Let w be a word of m letters over an alphabet of k letters. Suppose 
that no final substring of w is also an initial string of w. Use the 
sieve method to count the words of n letters, over that alphabet 
of k letters, that do not contain the substring w. 

(b) Use the Snake Oil method on the sum that you got for the answer 
in part (a). 

11. Use the Snake Oil method to do all of the following: 

(a) Find an explicit formula, not involving sums, for the polynomial 
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(b) Invent a really nasty looking sum involving binomial coefficients 
that- isn’t any of the ones that we did in this chapter, and evaluate 
it in simple form. 

(c) Evaluate 

f 2n+l \ fp + k\ 


^ \2p + 2k + lj\ k 

and thereby obtain a ‘Moriarty identity.’ 

(d) Show that 

* M r 


m \t — m 


r + s 


Then evaluate 


(e) Show that (Graham and Riordan) 


2n + 1\ (m + k\ ( 2m + 1 


(f) Show that for all n > 0 


n\ fk 


k ) \J 


x k = 


(1 + x) n K 


(g) Show that for all n > 0 


n + k\ fx 2 ~l\ n ~ k (x- l\ 2n+1 fx + l\ 2n+1 


(h) Show that for n > 1 


n + k — 1\ {x — l) 2k x n k _ ( x n — l) 2 
2k — 1 ) k n 


12. The Snake Oil Method works not only on sums that involve binomial 
coefficients, but on all sorts of counting numbers, as this exercise shows. 

(a) Let {a n } and { b n } be two sequences whose egf’s are, respectively, 
A(x ), B(x). Suppose that the sequences are connected by 

b n = 2_^ h ttk V n > 0), 
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where the [ ] ’s are the Stirling numbers of the first kind. Show 
that their egf’s are connected by 


B(x) = A ( log 


(1 ~x)J ' 


(b) Let b n be the number of ordered partitions of [n] (see 
Show that 


(5.2.7) 


E 


bk = n!2 


71—1 


(n > 1). 


(c) Let { a n } be the numbers of derangements (= hxed point free 
permutations) of n letters, and let {b n } be defined as in part (a). 
Show that 

<"»> - 1+ * ( r-,r 

(d) Repeat parts (a)-(c) on the Stirling numbers of the second kind, 
and discover a few identities of your own that involve them. 

(e) Generalize parts (a)-(d) to exponential families. 

13. Prove that 


E(-d 


n—k 


2 n 
k 


2 n 
n 


by exhibiting this sum as a special case of a sum with two free parameters, 
and by using Snake Oil on the latter. 

14. To do a sum that is of the form 


Sin) = ^2f{k)g(n - k ), 


the natural method is to recognize S(n) as [x n ]{i 7 '(a:)G(x)}, where F and 
G are the opsgf’s of {/ n } and {g n }- Use this method to evaluate 


s w = Erir 


2k 


1 


k ) n — k + 1\ n — k 


2 n — 2k 


15 . 


(a) Prove the following generalization of (4.2.19) 


and show that it is 


indeed a generalization. For all m,n,q > 0, we have 
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(b) The Jacobi polynomials may be defined, for n > 0, by 


Pt b \x) = v 


■ \ / . 7 \ / -1 \ Ti h / . -j \ k 

n + a\ f n + b\ x — 1 \ / x + 1 


k 


n — k 


Use the result of part (a) to show also that 

'n + a + b + j\ fn + a\ ( x — 1 x J 


Pj b) {x) = XI 


J 


j + a 


(c) Use the result of part (b) and a dash of Snake Oil to show that 


Pj b Hx) = 2-"(* - 1)-“ [i" + “ + ‘] { (1 *1 


n+a 


16. Prove the following generalization of the sum in example 2: 
sequences {/ n } and {c/ i: } are connected by the equations 

(n > 0), 


if two 


/ n + k 

Jn ~ / , ( . m I c k 


m + 2k 


where m > 0 is fixed, then their opsgf’s are connected by 


F(x) = 


x 


C 




X 


Say exactly what was special about the sequence {cfc} that was used in 
example 2 that made the result turn out to be so neat in that case. 

17. T he purpose of this problem is to show the similarity of the method of 
[WZ] to some well known continuous phenomena. 

(a) Let F(x,y), G(x,y ) be differentiable functions that satisfy the 
conditions that F x — G y and liniy^-i-oo G(x, y) = 0, for all x in a 
certain interval a < x < b. Show that we have the ‘identity’ 


F(x,y)dy = const. (a < x < b). 


(b) Show, using the result of part (a), that if f(z ) is analytic in the 
strip — oo<a<3fJ^<6<oo, and if / — > 0 on all vertical lines in 
that strip, then the conclusion of part (a) holds, where F(x,y) is 
the real part of f(z). 

(c) Show that the result stated in part (b) is true without using the 
result of part (a), but using instead the Cauchy integral theorem 
applied to a suitable rectangle. 
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(d) Apply these results to f(z ) = e z 
tity’ 

r°° 2 

/ e~ y cos (2 xy)dy 


and thereby discover the ‘iden- 
= ce~ x ( x real), 


2 

which states that the function e~ y is its own Fourier transform. 
Find c. 


18. This problem gives a neat proof of Cayley’s formula for the number of 
trees of n vertices, by showing more, namely that there is a pretty formula 
for the number of such trees even if the degrees of all vertices are specified. 

(a) Let di , . . . , d n be positive integers whose sum is 2 n — 2. Show that 
the number of vertex-labeled trees T of n vertices, in which for all 
i = 1, . . . , n it is true that di is the degree of vertex i of T, is exactly 


fn ( d 1 , • • • , d n ) 


(n — 2)! 

(di — l)!(d 2 — !)!■■■ (d n — 1)! ’ 


(Do this by induction on n. Show that one of the di s, at least, must 
be =1, and go from there.) 

(b) Find the generating function 


Fn ('ll , • • • , Xn) — ^ ^ fn (d \ , • • • , djfjX^ • • • X n ” 

d \ H \~dyi =2n — 2 

di , . . . , df i > 1 


in a pleasant, explicit form, involving no summation signs. 

(c) Let Xi = X 2 = ■ ■ ■ = x n = 1 in your answer to part (b), and thereby 
prove Cayley’s result that there are exactly n n ~ 2 labeled trees of n 
vertices. 

(d) Use the sieve method to show that if is the number of vertex 
labeled trees on n vertices of which k are endpoints (vertices of degree 
1), then 



(e) 


Show that the average number of endpoints that trees of n vertices 
have is 





n 

e 



oo), 


i.e. , the probability that a random vertex of a tree is an endpoint is 
about 1/e. 
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19. 

(a) If N(C. S ) is the number of objects whose set of properties is con- 
tained in S , then for all sets T, the number of objects whose set of 
properties is precisely T is 

N(= T)=J2 (~l) |5HT| iV(C 5). 

SCT 


(b) Let S' be a fixed set of positive integers, and let h n (S ) be the number 
of hands of weight n , in a certain labeled exponential family, whose 
card sizes all belong to S. Find the egf of {h n (S)}. 

(c) Multiply by (— 1)I' S 'H T I and sum over S C T to find the egf of 
{^n(T)}n> 0, the number of hands whose set of distinct card sizes is 
exactly T, in the form (the d ' s are the deck sizes) 


E 

n> 0 


M?) 

\~ X 

n.\ 



(d) Let p(n , k) be the number of hands of weight n that have exactly k 
different sizes of cards (however many cards they might have!). Sum 
the result of (c) over all \T\ = k, etc., to find that 


E 

n,k> 0 


71 ,! 



(e) Let c n be the average number of different sizes of cycles that occur 
in permutations of n letters. Show that the opsgf of {c n } is 

rbE (i--'* )• 

t> i v 7 

and find an explicit formula for c n . 

20. Begin with the set {1,2, ...,n}. Toss a coin n times, once for each 
member of the set. Keep the elements that scored ‘Heads’ and discard the 
elements that got ‘Tails’. You now have a certain subset S of the original 
set. Call this whole process a ‘step’. Now take a step from S. That is, 
toss a coin for each element of S, and keep those that get ‘Heads’, getting 
a sub-subset S ' , etc. The game halts when the empty set is reached. Let 
f(n, k, r ) be the probability that after k steps, exactly r objects remain. 

(a) Find a recurrence relation for /, find the generating function for /, 
and find / itself. 

(b) What is the average number of steps in a complete game? 
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(c) What is the standard deviation of the number of steps in the game? 

let f(n, k. t ) be the number of HC-polyominoes that 


section 4.7 


21. As in 

have n cells, in k layers, the highest layer consisting of exactly t cells. Show 
that the ‘grand’ three-variable generating function is 


]T/(n,MK</V = 


n,k,t 


xyz( 1 — x) 2 ((l — xz)( 1 — x) 2 + x 2 y{z — 1)) 

(1 — xz) 2 (( 1 — x ) 4 — xy( 1 — x — x 2 + x 3 + x 2 y)) 


Note that you do not have to start from scratch here, but instead you can 
use the results of 


section 4.7 


22. Let (ai, &i), . . . , (a*,, b^) be an exact covering sequence. Show that 


E v = V2= E l 


j'.dj is even 


j:aj is odd ^ 


Generalize this result to other residue classes for the aj. 

23. What is the probability that a random permutation has equal numbers 
of r-cycles and s-cycles? Express your answer in terms of Bessel functions 
(see chapter 2). Make a table of your answer, as a function of r and s, for 
1 < r < s < 6. 


24. Find a three term recurrence relation, whose co efficient s are polyno- 
mials in n, that is satisfied by the quantity shown in (4.8.1) which is the 
number of convex polyominoes of perimeter 2 n + 8. 


25. For (di,bi) \\ =1 to be an exact covering sequence it is necessary and 
sufficient that for all n such that 0 < n < N. n is congruent to a, mod b t 
for exactly one i, where N is the least common multiple of b\ . . . . . 6/,.. 


26. 


a 


Develop the generalization of the exponential formula that we were 
really using in section 4. 7] Precisely, suppose that for each i = 
1, 2, 3, . . . we are given a set S, t of positive integers. Let h{n ) be the 
number of hands of weight n that can be formed from a given collec- 
tion of decks if our choices of cards are restricted by the condition 
that for each i = 1, 2, 3, . . ., the number of cards of weight i that are 
chosen for the hand must lie in the set Si. Then show that 


EM<=n«p s (f). 

n> 0 i= 1 


where exp s (x) is the subseries of the exponential series whose indices 
lie in the set S and, as in chapter 3, di is the number of cards in the 
7th deck. 
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(b) Find the egf of {f(n)}, where f(n) is the number of partitions of the 
set [n] in which the number of classes of size 2 is divisible by 2 and 
the number of classes of size 3 is divisible by 3, etc. 


27. In order that (oj,6i ) \^ =1 be an exact cover ing sequence of residues and 


moduli, it is necessary and sufficient that [Fr] 


= (11 = 0 , 1 , 2 ,...) 


i= 1 


where the {B n } are the Bernoulli numbers (defined by (2.5.8)) 
B n (x ) are the Bernoulli polynomials, defined by 


and the 


te xt 

e 1 — 1 


E 

n=0 


B n (x)t r 


n\ 


28. Find a formula for the number of square roots that a permutation has. 
What kind of a permutation has a unique square root? 


5.1 The Lagrange Inversion Formula 
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Chapter 5 

Analytic and asymptotic methods 


In the preceding chapters we have emphasized the formal aspects of 
the theory of generating functions, as opposed to the analytic theory. One 
of the attractions of the subject, however, is how easily one can shift gears 
from thinking of generating functions as mere clotheslines for sequences to 
regarding them as analytic functions of complex variables. In the latter 
state of mind, one can deduce many properties of the sequences that are 
generated that would be inaccessible to purely formal approaches. Notable 
among these properties are the asymptotic growth rates of the sequences, 
which are probably the main focus of the analytic side of the theory. Hence 
in this chapter we will develop some of the analytic machinery that is invalu- 
able for such studies. For an introduction to asymptotics and de finition s of 
all of the symbols of asymptotics, see, for example, chapter 4 of 


Wil 


5.1 The Lagrange Inversion Formula 

The Lagrange Inversion Formula is a remarkable tool for solving certain 
kinds of functional equations, and at its best it can give explicit formulas 
where other approaches run into stone walls. The form of the functional 
equation that the LIF can help with is 

u = t<p(u). (5.1.1) 


Here <j) is a given function of u , and we are thinking of the equation as 
determining u as a function of t. That is, we are ‘solving for u in terms of 
t: 

We found one example of such an equation in 


section 3.12. 


There we 

saw that if T(x ) is the egf for the numbers of rooted labeled trees of each 
number of vertices, then T(x) satisfies the equation T = xe T , which is 
indeed of the form (5.1.1), with </>(«) = e u . 

The general problem is this: suppose we are given the power series 
expansion of the function <j) = (Hu), convergent in some neighborhood of 
the origin (of the 'U- plane) . How can we find the power series expansion of 
the solution of (5.1.1), u = 'u(f), in some neighborhood of the origin (in the 
f-plane)? The answer is surprisingly explicit, and it even allows us to find 
the expansion of some function of the solution u(t). 


Theorem 5.1.1. (The Lagrange Inversion Formula) Let f(u) and (p(u) be 
formal power series in u , with </>(0) = 1. Then there is a unique formal 
power series u = u(t) that satisfies (5.1.1). Further, the value f(u(t )) of f 
at that root u = u{t), when expanded in a power series in t about t = 0, 
satisfies 


m {/wi))} = iK- 1 ]{/'(«w«r}. 

/ i 


(5.1.2) 
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Proof. First we note that it suffices to prove the theorem in the case 
where / and cp are polynomials. Indeed, if n is fixed, and if / and cp are 
full formal power series, then suppose that we truncate both of those series 
by discarding all terms that involve powers u k for k > n. If the result is 
true for these polynomials then it remains true for the original untr uncatec 


series, since the higher order terms that were discarded do not affect (5.1.2) 
for the fixed n, at all. 

Therefore we now suppose that / and (p are polynomials. 

We will first make a formal computation, and then discuss the range 
of validity of the results. We have 


K -1 ] {/»</>«} = [u 71 - 1 ] {f\u)(u/t) n } 

= {/'(«)/*”} 

= J_ f /'(«) 

2ni J 

= -/ 

2ni I 


. . -du 
t(u) n 

t~ n f' (u(t))v! (t)dt 


(5.1.3) 


= [ t n ] {t(d/dt)f(u(t))} 
— n [t n ] f(u(t)). 


In the above, the first equality comes from (5.1.1) the second is trivial, 
and the third is the residue theorem of complex integration, in which the 
integrand is a function of u and the contour is, say, a small circle enclosing 
0 . 

The fourth equality needs some discussion. Consider the behavior of 
the function g(u) = u/cp(u ) near the origin. Since (p(0) = 1, the function (p 
remains nonzero in a neighborhood of 0. Hence g is analytic there, and it 
has a power series development of the form u + cu 2 f • • • . It follows that g 
is a 1-1 conformal map near 0. Hen ce it has a well defined inverse mapping 
that is itself analytic near 0. Thus (5.1.1) has a unique solution u = u(t) 


in some neighborhood 3? of t = 0, and u is an analytic function of t there. 
If the contour of integration in the integral that appears after the fourth 
equals sign above is a circle around t = 0 that lies in 3?, then the sign of 
equality is valid simply as a change of variable from u to t in the integral. 
The remaining equalities are trivialities, and the proof is complete. | 

Example 1. 


In section 3.12 we embarked on proving 


theorem 3.12.1 


to the effect 

that there are exactly n n ~ 2 labeled trees of n vertices. We found that if 
D(x ) paJP {t n }i where t n is the number of rooted labeled trees of n vertices, 
then T>(x) satisfies the functional equation 


V(x) 


xe 


V(x) 


(5.1.4) 
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we can actually solve 


[. x n } V{x) 


(1/n) 

y- 1 ] {4,(u) n } 

(1 In) 

[ u n “ 1 ] {e nu } 

(1/n) 

n n_1 

(n — 1)! 

n n ~ 1 


n! 



Hence, 


tn 

n\ 


n 


n—1 


n\ 


and t n = ri n But t n is the number of rooted trees of n vertices, and every 
labeled tree contributes n rooted labeled trees, so the number of labeled 


trees of n vertices is n n 2 , which completes the proof of theorem 3.12.1 


Example 2. 

At the end of chapter 4 we discussed inverse pairs of summation for- 


mulas and gave some examples beyond the Mobius inversion formula. Here 


we’ll derive a much fancier example with the help of the LIF. We propose 
to show that if two sequences {a n } and { b n } are related by 


*>» = £ 


k 

n — k 




(5.1.5) 


then we have the inversion 


TlCLn. — 


£ 


2n — k — 1 
n — k 


■ 1 ) 


n ~ k kb k . 


Indeed, if (5.1.5) holds, then, as we did so often in 


section 4.3 


(5.1.6) 


multiply 


by x n . sum on n. and interchange the k and n summations, to get 


fl(*) = £a k z‘£( * )*"-* 

k n ' ' 

= y^Q fc X fc (l +x) k 
k 

= A(x( 1 -I- x)), 


(5.1.7) 


where A, B are the opsgf’s of the sequences. Now we can solve for A in 
terms of B by setting y = x + x 2 . Then if we read (5.1.7) backwards, we 
find that 


A{y) = B(x(y)), 


(5.1.8) 
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where x(y) is the solution of the quadratic equation y = x+x 2 that vanishes 
when y = 0. 

Now we could , of course, solve the quadr atic exp licitly. If we were to 
do that (which we won’t) we would find from (5.1.8) [that 


A(y) = B 


yr +%-1 


and then we would want to find a nice formula for the coefficient of y n on 
the right hand side for every n. That, in turn, would require a nice formula 
for 


[y n ] 


v / T +%-1 


(5.1.9) 


for every n and k. Instead of trying to deal with (5.1.9) by explicitly raising 
the quantity in braces to the kth power and working with the square root, 
it is a lot more elegant to let the LIF do the hard work. 

We begin by rephrasing the question (5.1.9) implicitly , rather than 
explicitly. What we want is 


where x = x(y) is the solution of y 
the LIF, we write the equation as 


[y n }{x(y) k }, 

x + x 2 that is 0 at y = 0. In terms of 


x = 


y 


1 + x 1 


which is of the form (5.1.1) with <f(u) = 1/(1 + u ). Further, since we want- 
the coefficients of the kth power of x(y), the function f(u) in the LIF is 
now f(u) = u k . 


The conclusion (5.1.2) 


of the LIF tells us that 
kx k ~ x 


[y n ]{x(y)} k = (l/n)^- 1 ] 
= ( k/n)[x n ~ k ] 


(1 + x) 7 

1 


(1 + x) 7 


= (fc/n)(-l) 


n—k 


2n — k — 1 
n — k 


and we are all finished with the proof of (5.1.6) 


It should be particularly noted that the LIF is as adept at computing 
the coefficients of the kth power of the unknown function as those of the 
unknown function itself. The function / in the statement of the LIF simply 
specifies the function of the unknown function whose coefficients we would 
like to know. The LIF then hands us those coefficients. 


5.2 Analyticity and asymptotics (I): Poles 
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5.2 Analyticity and asymptotics (I): Poles 

Suppose we have found the generating function f(z) for a certain se- 
quence of combinatorial numbers that interests us. Next we might want to 
find the asymptotic behavior of the sequence, i.e., to find a simple function 
of n that affords a good approximation to the values of our sequence when 
n is large. 

The first law of doing asymptotics is: look for the singularity or sin- 
gularities of f(z) that are nearest to the origin. The reason is that f(z) is 
analytic precisely in the largest circle centered at the origin that contains 
no singularities, and we find the radius of that circle by finding the singu- 
larities nearest to the origin. Once we have that radius, we have also the 
radius of convergence of the power series f(z). Once we have the radius of 


convergence we know somet 
is large, as in 


ring about the sizes of the coefficients when n 
By various refinements of this process we can 


theorem 2.4.3. 

discover more detailed information. 

Therefore, in this and the following sections we will study the influence 
of the singularities of analytic functions on the asymptotic behavior of their 
coefficients. These methods, taken together, provide a powerful technique 
for obtaining the asymptotics of combinatorial sequences, and provide yet 
another justification, if one were needed, of the generating function ap- 
proach. 

In this section we will concentrate on functions whose only singularities 
are poles. 

Let f(z) be analytic in some region of the complex plane that includes 
the origin, with the exception of a finite number of singularities. If R is 
the smallest of the moduli of these singularities, then / is analytic in the 
disk \z\ < R , so this will be precisely the disk in which its power series 
expansion about the origin converges. 

Conversely, if the power series expansion of a certain generating func- 
tion / converges in the disk \z\ < R but in no larger disk centered at the 
origin, then there are one or more singularities of the function / on the 
circumference \z\ = R. 

A number of methods for dealing with questions of asymptotic growth 
of coefficient sequences rely on the following strategy: find a simple function 
g that has the same singularities that / has on the circle \z\ = R. Then 
f — g is analytic in some larger disk, of radius R' > A. say. 

Then, according to theorem 2.4.3 (q.v.), the power series coefficients 
of f — g will be < (-^7 + e) n for large n, and therefore they will be much 
smaller than t he coefficients of / itself. The latter, according to the same 
theorem 2.4.3. will infinitely often be as large as (4 — e) n . 


Therefore we will be able to find the most important aspects of the 
growth of the coefficients of / by looking at the growth of the coefficients 
of g. 

The strategy that wins, therefore, is that of finding a simple function 
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that- mimics the singularities of the function that one is interested in, and 
then of using the growth of the coefficients of the simple function for the 
estimate. 

These considerations come through most clearly in the case of a mero- 
morphic function f(z), i.e., one that is analytic in the region with the 
exception of a finite number of poles, and so we will study such functions 
first. The idea is that near a pole zq, a meromorphic function is well ap- 
proximated by the principal part of its Laurent expansion, i.e., by the finite 
number of terms of the series that contain (z — Zq) raised to negative powers. 


Example 1. 

The function f(z) = e 2 /(z — 1) is meromorphic in the whole finite 
plane. Its only singularity is at zo = 1, and the principal part at t- hat- 
singularity is e/(z — 1). Hence the function f(z) — e/(z — 1) is analytic in 
the whole plane. Thus if {c n } are the power series coefficients of / about- 
t-he origin, and { d n } are the same for the function e/(z — 1), the difference 
c n — d n is small when n is large. In fact, theorem 2.4.3 guarantees that for 
every e > 0 we have \c n — d n \ < e n for all large enough n. Therefore the 
‘unknown’ coefficients of f(z) are very well approximated by those of the 
simple function e/{z — 1). 

It is easy to work out- this particular example completely to see just 
ho w the machine works. The expansion of e z /(T — 1) about the origin is 
(see 


Rule 5 of section 2.2 


T — — = - ^{1 + 1 + 1 / 2 ! + ■ ■ ■ + l/n\}z n . 


On the other hand, the expansion of e/(z — 1) is 

£ 2 3 

= — e — ez — ez — ez — • • • . 

(z- 1 ) 

Hence the act of replacing the function by its principal part yields in one 
step the approximation of the true coefficient of z n , which is the nth partial 
sum of the power series for — e, by — e itself, which is a smashingly good 
approximation indeed. The reason for the great- success in this case is that 
merely subtracting off one principal part- from the function expands its disk 
of analyticity from radius =1 t-o radius = oo. 

Here’s another way to look at this example, without, mentioning any 
of the heavy machinery. Consider the innocent fact that 


z — 1 z — 1 z — 1 

In the first term, the power series coefficients are all equal to — e. The 
second term has no singularities at- all in the finite plane, i.e., it is an entire 
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function of z. By theorem 2.4.3, its coefficients are 0(e n ) for every positive 
e. Therefore the coefficients of f(z) are 


— e -f 0(e n ) (n — > oo) 


for every e > 0. | 

In less favorable cases one may have to subtract off several principal 
parts in order to increase the size of the disk of analyticity at all (i.e. , if 
there are several poles on the circumference) , and even then it may increase 
only a little bit if there are other singularities on a slightly larger circle. 

If / is meromorphic in 3?, let zq be a pole of / of order r, 1 < r < oo. 
Then in some punctured disk centered at zq, f has an expansion 

r oo 

/(*) = J2 ( , a Sl )j +J2 a ^ z ~ z °y • t 5 - 2 - 1 ) 

The first one of the two sums above, the one containing the negative powers 
of (z— zo), is called the principal part of the expansion of / around the singu- 
larity z o, and we will denote it by PP(f ; zq). The function / — PP(/; zq) is 
analytic at zq- That is to say, we can remove the singularity by subtracting 
off the principal part. 

If, besides zq, there are other poles of / on the same circle \z\ = R = 
\zo |, then let z±, ... , z s be all such poles. The function 

h(z) = f(z) - PP(f ; z 0 ) - PP(f; z x ) PP(/; Zs) (5.2.2) 


is regular (analytic) at every one of the points {zj} q. But / had no other 
singularities on that circle, so h is analytic in a circle centered at the origin 
that has radius R ' , where R' > R. 

That means, by | theorem 2.4.3 l again, that the power series coefficients 
of h, about the origin, cannot grow faster than 


lV +e 


for all large n. Thus, if f PZ {°n } 5 an d if 


g(z) = PP(f ; zo) + PP(/; *!) + ■■■ + PP(/; Zs) Z! {bn}: (5.2.3) 


then 




b n + O 




oo). 


We may then be well on our way towards finding the asymptotic behavior 
of the coefficients of /. 
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Indeed, let us now study the power series coeffi cients, a 


of the sum of the principal parts that are shown in (5.2.3). 
is a pole of multiplicity r, 


lout the origin, 
We have, if z 0 


PP(f-,z o) 


u -j 

h( z ~ z oV 


E 




Zoi 1 - ( z/z 0 )) j 


y^ y 

j= 1 z 0 n > 0 


n + j — 1 


n 


(-/-o) r 


E** E 


n>0 ''j=l ~0 


(- 1 ) J ~o-j fn + j — 1 

n+j \ j-1 


(5.2.4) 


We see, therefore, that a pole of order r at £o, 
tributes 


(~1 Y a -j 

2-^t n+j 

3=1 Z 0 


nzj 


of a function /, con- 


(5.2.5) 


to the coefficient of z n in /. 

The basic theorem, which asserts that we can well approximate the 
coefficients of a meromorphic function by the coefficients of the principal 
parts at its poles of smallest modulus, can be stated as follows: 

Theorem 5.2.1. Let f be analytic in a region T containing the origin, 
except for finitely many poles. Let R > 0 be the modulus of the pole(s) 
of smallest modulus, and let zq, , z s be all of the poles of f(z) whose 
modulus is R. Further, let R' > R be the modulus of the pole(s) of next- 
smallest modulus of f, and let e > 0 be given. Then 


M/M = M 


E pp d ;m} + o 

3=0 J 


R 1 




(5.2.6) 


Proof. By theorem 2.4.3. this theorem will be proved as soon as we estab- 


lish that if we subtract from f(z) the sum of all of its principal parts from 
singularities on the circle \z\ = R, then the resulting function is analytic in 
the larger disk \z\ < R' . Consider the moment when we subtract PP(f: Zq) 
from f(z). Certainly the resulting function, (j, say, is analytic at Zq. Next, 
however, we subtract PP(f;z \ ) from g instead of subtracting PP(g: Z \ ) 
from g. We claim that this doesn’t matter, i.e., that 


pp(f - pp(f-,z 0 y, Zl ) = pp(f-, Zl ). 
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To see this, observe that 

PP(f ~ PP(f; zo); Zl ) = PP(f ; z x ) - PP(PP(f ; z 0 ); * 1 ). 

But the second term on the right vanishes because PP{f ; zo) is analytic at 
zi. 

By induction on s, the result follows. | 


Example 1. Ordered Bell numbers 

We now investigate the asymptotic behavior of the ‘ordered Bell num- 
bers.’ These are defined as follows: a set of n elements has {^} partitions 
into k classes. If we now regard the order of the classes as important, but 
not the order of the elements within the classes, then we see that [n] has 
&!{/} ordered partitions into k classes. The ordered Bell number b(n) is 
the total number of ordered partitions of [n], i.e. , it is 

Our question concerns the growth of { b(n)} when n is large. To find a 
nice formula for these numbers, multiply both sides of the identity (4.2.16) 
by e~ y and integrate from 0 to oo. This gives the neat result that 

*») = E£ t- ( 5 ' 2 - S > 

r>0 Z 


Then the exponential generating function of the ordered Bell numbers is* 


/« = E 

n> 0 


b(n) , 

n\ 


1 

2 - e 2 ' 


(5.2.7) 


We’re in luck! The generating function f(z ) has only simple poles, 
namely at the points log 2 ± 2km for all integer k. The principal part at 
the pole Zo = log 2 is (—1/2 )/(z — log 2). That principal part all by itself 
contributes 

1 

2 (log 2) n+1 

to the coefficient of z n . There are no other singularities of f(z ) on the circle 
of radius log 2 centered at the origin. Hence 


h(z) = f(z) 


(- 1 / 2 ) 

(z ~ log 2) 


is analytic in the larger circle that extends from the origin to log 2 + 2tx i. 
The radius of that circle is 


p — \J (log 2) 2 + An 2 = 6.321 


* Be sure to work this out for yourself. 
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Hence the coefficients of h(z) are 0((.16) n ). Altogether, we have shown 
that the ordered Bell numbers b(n ) are of the form 

H") = 2(log2)-W + Oft 16 )"" 1 ), (5.2.8) 

which is not bad for so little effort invested. More terms of the asymptotic 
expansion can be produced as desired from the principal parts of f(z) at 
its remaining poles, taken in nondescending order of their absolute values. 
The reader should look into the contribution of the next two poles together, 
which are complex conjugates of each other. 

Below we show a table of some values of n, b(n), and n!/(2(log2) n+1 ). 


n 

1 

2 

3 

5 

10 

b(n) 

1 

3 

13 

541 

102247563 

n!/( 2(log2) n+1 ) 

1.04 

3.002 

12.997 

541.002 

102247563 


The agreement is astonishingly close. Basically all we have done is to 
use the Taylor coefficients of the series for 1 / (2(log 2 — z)) as approximations 
to the coefficients of the series for 1/(2 — e z ). Yet we are rewarded with a 
superb approximation. | 


Example 2. Permutations with no small cycles 

Fix a positive integer q. Let f(n. q ) be the number of permutations 
of n letters whose cycles all have lengths > q. We want the asymptotic 
behavior of f(n,q). 

By exercise 11 of chapter 3,1 the egf of {/(n, g)}o° is 


f q (z) = exp ^ 


n>q 


Z 

n 


exp { log S 

l<n<g 


Z' b 

n 


-{z+...+zy q } 


1 


(5.2.9) 


The only singularity of f q (z) in the finite plane is a pole of order 1 at 
z = 1 with principal part e~ Hq /(I — z), where 


11 1 
H a — 1 + — + — + • ■ ■ H — 


is the gth harmonic number. 


5.3 Analyticity and asymptotics (II): Algebraic singularities 
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This is the kind of situation where we get very accurate asymptotic 
estimates, because the difference between the function and its principal part 
at z = 1 is 

K z ) = fq( z ) ~ f— J 

X £ 

e -{z-\ \~z q /q} _ e -H q 


and is analytic in the whole plane, i.e. , is an entire function. Again, by 
I theorem 2.4.31 the nth coefficient of h{z) is 0(e n ) as n — > oo for every 
e > 0. Therefore, 


f(n,q) 


= e 




n\ 


q + 0(e n ) (n — > oo). 


(5.2.10) 


The strikingly small error term in this estimate suggests that for each 
fixed q the probability that an n-permutation has no cycles of length < q 
should be very nearly independent of n. Consider the case q = 1 to get some 
of the flavor of what is going on here. Then /(n, l)/n! is the pro bability 


that an n-permutation has no fixed point. But we saw in (4.2.10) 


that 


/(». l)/»! = e~ n ' = 1 - 1 + 1/2 
= e -1 + 0(l/n!). 


+ (-l) n /n! 


Indeed, the probability is very nearly independent of n, and the error in- 
volved in using the principal part is 0(e n ) for every positive e. || 

The two examples above have shown the method at work in situations 
where it was atypically accurate. More commonly one finds not just one 
pole of order 1 in the entire plane, but many poles of various multiplicities. 
The method remains the same in such cases, but a lot more work may be 
necessary in order to get estimates of reasonable accuracy. An example 
that shows this kind of phenomenon was worked out in section 3.15, in 
connecti on with the mone y-changing problem. In fact, the proof of Schur’s 
theorem 


Theorem 3.15.2 


was an exercise in the use of principal parts at the 
poles of a meromorpliic function. The importance of the single dominant 
singularity was, in that case, much less, though it was enough to get the 
theorem proved! 


5.3 Analyticity and asymptotics (II): Algebraic singularities 

Again, let f(z) be analytic in some region that contains the origin, but 
now suppose that the singularity zq of / that is nearest to 0 is not a pole, 
but is an algebraic singularity ( branch point). What that means is that 
f(z ) = (zo — z) a g(z), where g is analytic at zo and a is not an integer, but 
is a real number. 
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A case in point was given in (3.9.1) where we found that the egf for 
the numbers of graphs of n vertices whose vertex-degrees are all equal to 2 
is 

e -z/2-z 2 /4, 


m = 


VT 


(5.3.1) 


In this section we will derive the theorem (‘Darboux’s lemma’) that 
allows us to deduce the asymptotics of sequences with this kind of a gener- 
ating function. What it all boils down to is that one should do exactly the 
same thing in this case as in the case of meromorphic functions, and the 
right answer will fall out. The proof that this is indeed valid, however, is 
more demanding in the present case. We follow the proof in [KnW] 

By considering f(zzo) instead of f(z), if necessary, we see that we can 
assume without loss of generality that zq = 1. Hence we are dealing with a 
function / that is analytic in the unit disk, and which has a branch point 
at z = 1. We will also assume, until further notice, that zo = 1 is the only 
singularity that / has in some disk \z\ < 1 + 77, where r/ > 0. 

After the lessons of the previous section on meromorphic functions, 
here’s how we might proceed in this case. First we have f(z) = (1 — z) a g(z), 
where g is analytic at z = 1. That being the case, we can expand g in a 
power series 

9(z) = ^9k{l-z) k 

k> 0 

that converges in a neighborhood of z — 1. Hence / itself has an expansion 

/(*) = £>( l-~)* +a - (5-3.2) 

fc> 0 


By analogy with the procedure for meromorphic functions, we might 
expect that each successive term in the above series expansion generates 
the next term of the asymptotic expansion of the coefficients of /. That is 
in fact true. The dominant behavior of the coefficient of z n in f(z) comes 
from the first term in (5.3.2). That is, the simple function < 70(1 — z ) a has, 
for its coefficient of z n , the main contribution to that coefficient of /, etc. 

We will now prove all of these things. 


Lemma 5.3.1. Let {a„}, {b n } be two sequences that satisfy (a) a n = 
0(n~z) and (b) b n = 0(9 n ) (0 < 6 < 1). Then 

^ ^akbn-k = 0(n -7 ). 
k 

Proof. We have first (the 0’s are not all the same constant) 


0<k<n/2 


,— k 


< < max lofc 

[0<k<n/2 


2^ C9 7 

0<k<n/2 


< ma x{0, Cn~^}{C9 n / 2 } 

< C9 n (0 <9 < 1). 
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Further, 


^ ^ &kbn—k 

n/2<fc<n 


< 


max 

n/2<fc<n 



J2 o n ~ k 

n / 2<k<n 


< Cn ~ 7 


Lemma 5.3.2. If (3 ^ {0, 1,2,.. then 


[*”](! -zf 


n 


-0-i 


n-py 


Proof. We have 




n — f3 — 1 

n 

= r(n-/?) 

F(-/3)r(n+ 1 )’ 

and the result follows from Stirling’s formula, which is 


— J v27 rn (n — > oo). 


(5.3.3) 


Lemma 5.3.3. Let tt(z) = (1 — z) 7 u(z), where v{z) is analytic in some 
disk \z\ < 1 + r), (p > 0). Then 


[z n \u(z) = 0(n - 7 - 1 ). 


(5.3.4) 


Proof. Apply lemma 5.3.1 with o n = [z n ]( 1 — 


v is analytic in a disk 
by lemma 5.3.2. | 


z\ < 1 + r), we have b r 


z) 7 and b n = [. z n ]v(z ). Since 
= O(0 n ). The result follows 


Theorem 5.3.1. (Darboux) Let v(z) be analytic in some disk \z\ < 1 + p, 
and suppose that in a neighborhood of z = 1 it has the expansion v(z) = 
J2 v j( 1 — z) J ■ Let (3 ^ { 0,1,2,...}. Then 


[*”] {(1 - zfv(z)} = [*“] ( - zf+> } + 

3= 0 


= £*>. 
3=0 


n- (3- j - 1 
n 


+ 0(n- m - f3 ~ 2 ). 


(5.3.5) 
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Proof. We have 


(1 - zfv(z) - 5>j(l - zf+i = £ i>,(l - zf + ‘ 

.7=0 j>m 

= (1 - z) p+m+1 v(z), 

where the r egions of analy ticity of v and of v are the same. The result now 
follows front lemma 5.3.3. ■ 


Example 1. 2-regular graphs 


of the number 


For the exponential generating function f(z). in (5.3.1) 
of 2-regular graphs of n vertices, we have f(z) = (1 — z) >3 v(z) with /3 = —1/2 
and v(z) = exp{— z/2 — z 2 / 4}. The first few terms of the expansion of v(z) 
about z = 1 are 


,-z/2-z’/4 = e -3/4 + g— 3/4,2 _ z) + I e -3/4(l - z f + . . . 


Then according to 


(5.3.5) this expansion of v(z) around z = 1 ‘lifts’ 


to an asymptotic formula for the coefficients of f(z), which are in this case 


7( n)/n!, w. 


use 


(5.3.5) 


tere 7 (n) is the number of 2-regular graphs of n vertices. If we 
with m — 2, we obtain 


7(«) _ —3/4 f n ~ l/2\ 3/4 f n ~ 3/2\ 1 3/4 (n- 5/2^ 

n\ V n + \ n r 4 l n t 


If we like, we can further simplify the answer by using the known 
asymptotic expansion of the binomial coefficient 


fn — a — 1\ n a 1 
V n ) ~ r (-«) 


cr(ct -)- 1) ci(ci T l)(ci T 2)(3ct T 1) 
+ 2n + 2T/2 


If this be substituted into (5.3.6), the result is 


+ ■ ■ ■ . 

(5.3.7) 


7(n) « 


n\e 3//4 
y/mr 



1 

128n 2 



(5.3.8) 


The form of Darboux’s method that we have proved applies when there 
is just one algebraic singularity on the circle of convergence. The method 
can be extended to several such singularities. We quote without proof a 
more general result of this kind ( [Sz] , thin. 8.4): 
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Theorem 5.3.2. (Szego) Let h(w ) be analytic in |u;| < 1 and suppose it 
has a finite number of singularities {e l ^ k }\ on |w| = 1. Suppose that in the 
neighborhood of each singularity e l ^ k there is an expansion 

h(w) = ^^ fc) (l - we-^ k ) ak+ul3k , 

u>0 


where /3k > 0. Then the following is a complete asymptotic series for the 
coefficients of h(w): 


w n ]h(w ) 


v>0 k= 1 ' 


5.4 Analyticity and asymptotics (III): Hayman’s method 

In the previous two sections we have seen how to handle the asymp- 
totics of sequences whose generating functions have singularities in the finite 
plane. Essentially, one looks for the singularity(ies) nearest the origin, finds 
simple functions whose behavior near the singularities is the same as that 
of the generating function in question, and then proves that the asymptotic 
behavior of the coefficients of that generating function is the same as that 
of the coefficients of the simple functions that behave the same way near 
the singularities. 

But what shall we do if the generating function doesn’t have any sin- 
gularities, i.e., if it is an entire function! 


Example 1. The coefficients of e z 

Consider the function e z . The coefficient of z n in e z is 1/n!. Can we 
think of some fairly general method for handling the asymptotics of entire 
functions, which in this case will derive Stirling’s formula for us? 

Here’s how we might begin. By Cauchy’s formula we have 

1 If eZ ^ z 

n\ 2i ri J z n+1 ’ 


where the contour of integration is some simple closed curve that encloses 
the origin. If we use for the contour a circle of radius r centered at the 
origin, then by taking absolute values we find that 


1 1 
— < — max 

n\ 271 \z\=r 



* 


(27t r) 


Since e z is an entire function the value of r > 0 is entirely up to us, 
so we might as well choose it to minimize the upper bound that we will 
obtain. But 

e r 

min — (5.4.1) 

r>0 r n 
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is attained at r = n, so the best possible estimate that we can get from this 
argument is that 

1 6 

— < (-) n - 
n\ n 

If we compare this with Stirling’s formula 


1 


1 


n\ y/2nir n 


(r) n > 


[Ha 

with 


we see that we haven’t done too badly, since our rather crude estimate 
differs from the ‘truth’ by only a factor of about l/-\/2mr. 

To do better than this we are going to have to treat the variation of 
e z around the contour of integration with a little more respect, and not 
just replace it by the maximum absolute value that it attains. Indeed, for 
most of the way around the circumference \z\ = r, the absolute value of e z 
is considerably smaller than e r . It is only in a small neighborhood of the 
point z — r that it is nearly that large. 

n his 1956 paper A generalisation of Stirling’s formula, W. K. Hayman 

developed machinery of considerable power for dealing more precisely 
this kind of situation. Further, his method is uncommonly useful for 
generating functions that arise in combinatorial theory, because these tend 
to have nonnegative real coefficients. For that reason, on any circle centered 
at the origin, such a function will be largest in modulus at the positive real 
point on that circle. Hayman’s method is strongest on just such functions. 

Hayman’s machinery applies not only to entire functions, but to all 
analytic functions, even those with singularities in the finite plane. In 
practice it has most often been used on entire functions, mainly because, 
as we have seen, other methods are available when singularities exist in the 
finite plane. 

Let f(z) be analytic in a disk \z\ < R in the complex plane, where 
0 < R < oo. Suppose further that f(z) is an admissible function for the 
method. Operationally, that simply means that f(z) is a function on which 
the method works. We will give some sufficient conditions for admissibility 
below. 

Define 

M(r) = max{|/(z)|}. (5.4.2) 


It will be a consequence of the admissibility conditions that 


M(r ) = /(r) (5.4.3) 

for all large enough r. This is because, as we remarked above, the method 
is aimed at functions that take their largest values in the direction of the 
positive real axis. 
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Next define two auxiliary functions, 

fir) 


a{r ) = r 


fir) 


and 


ft ( fff ( 

b(r) = ra'(r) = r — — ^ - + r 2 

f\ r ) f{r) 

The main result is the following: 


f'( r ) 

f(r) 


(5.4.4) 


(5.4.5) 


Theorem 5.4.1. (Hayman) Let f(z) = ^ a n z n be an admissible function. 
Let r n be the positive real root of the equation a(r n ) = n, for each n = 
1,2,.. ., where a(r) is given by eq. (5.4.4) above. Then 


f(r n ) 

r nV 27T b(r n ) 


as n 


Too, 


(5.4.6) 


where b(r ) is given by (5.4.5) above. 

It will be noted that the recipe itself is quite straightforward to apply. 
What is often difficult is determining whether the function f(z ) is admis- 
sible for the method or not. Before we explain that notion, let’s apply the 
theorem to f(z) = e z . taking on faith, for now, the fact that it is admissible. 


Example 1. (continued) The function e z 

First we would calculate a(r) = r, in this case, from (5.4.4). Then the 
equation o(r n ) = n, which determines {r n }, becomes just r n = n. These 
numbers r n are the same as those that we found earlier from the condition 


(5.4.1). They are simply the values of r at which the minimum of f(r)/r r 


occurs. 

Next we find b{r) = r from (5.4.5), and Hayman’s result (5.4.6) reads 


as 


1 


n\ 


u n v / 2mr 


which is Stirling’s formula again, but this time in its exact form. 


Next let’s give a precise definition of the class of admissible functions. 
Let f(z) = ^2 n>0 a n.z n be regular in \z\ < R , where 0 < R < oo. Suppose 
that 

(a) there exists an R 0 < R such that 

/(r) >0 (Rq <r < R), 

and 

(b) there exists a function S(r) defined for Rq < r < R such that 
0 < 8{r) <7 r for those r, and such that as r — R uniformly for 
\0\ < 8(r ), we have 

f(re ie ) ~ f( r )e ida{r) -^ 2b(r \ 
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and 


(c) uniformly for 8(r) < |0| < it we have 


f(re ld ) 


°(/( r )) 


R), 


and 


44 


as r 


R we have b(r) — > +oo, where o(r), &(r) are defined by 
(5.4.4), (5.4.5), Then we will say that f(z) is admissible , and the apparatus 
oT theorem 5.4.1~| above is available for determining the asymptotic growth 
of the coefficients {o n }. 

However, it isn’t always necessary to appeal directly to the definition 
of admissibility in order to be sure that a certain function is admissible. 
Here are some theorems that give sufficient conditions for admissibility, 
conditions that are much easier to verify than the formal definition above. 
If f(z ) is admissible, then so is eR z \ 

If / and g are admissible in \z\ < R, then so is fg. 

Let / be admissible in \z\ < R. Let P be a polynomial with 
real coefficients which satisfies P(R) > 0, if R < oo, and which 
has a positive highest coefficient, if R = +oo. Then f(z)P(z ) is 
admissible in \z\ < R. 

Let P be a polynomial with real coefficients, and let / be admis- 
sible in \z\ < R. Then f + P is admissible, and if the highest 
coefficient of P is positive, then P[f(z)\ is also admissible. 

Let P(z) be a nonconstant polynomial with real coefficients, and 
let f(z) = e p ^ z \ If [ z n ]f(z ) > 0 for all sufficiently large n, then 
f(z ) is admissible in the plane. 


(A) 

(B) 

(C) 


(D) 


(E) 


Example 2. 

Let t n be the number of involutions of n letters, i.e., the number of 
permutations of n letters whose cycles have lengths < 2. We will find the 
asympt otic beh avior of {t n }. 

By (3.8.3) the egf of the sequence {t n } is 


m = £ 


n n 


D z+\z 2 


n> 0 


n\ 


By criterion (E) above, f(z ) is clearly Hayman admissible in the whole 
plane. Hence Theorem 5.4.1 | applies. To use it, we first calculate the func- 
tions a(r), b(r) of |(5.4.4) and (5.4. 5)| We find that 


a(r) = r 


f'(r) 

f( r ) 


r + r 


and 


b(r) = ra (r ) = r + 2 r 2 . 
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Next we let r n be the positive real solution of the equation a(r n ) = n, which 
in this case is the equation 


r„ + rl = n. 


(5.4.7) 


Evidently 


r n = 


= \/n 


n+ i“i 


1 ) 2 1 

1 + fcf "2 


= vn 


1 + ^ 


1 


128n 2 


1 1 

U ~ 2 + 8^ 


+ 


1 


128n 3 / 2 


1 

~ 2 
H j 


(5.4.8) 


(by (2.5.6) 


so we have a very good fix on where r n is, in this case. 



Let’s take them one at a time. 

First, 

f(r n ) = e r " + ^ = e ^ r " +n) = e n/2 e r ™ /2 , 
where (5.4.7) was used again. But in view of (5.4.8), 

e r " /2 =expj^-i + 0( re - 1 / 2 )| 

~ e 5x/ " _I (n — ■> oo) 

and so 

f(r n ) ~ exp 1 1 1 1 (n-K»). (5.4.9) 

So far, so good. Next on the list is 6(r n ), and that one is easy since 


b(r n ) = r n + 2r 2 ~ 2r 2 ~ 2 n 
The last one is the hardest, and it is 


(n — > oo). 




= n '■ 


1 - 


1 


2y / n 8n 


1 

+ 


(5.4.10) 


(5.4.11) 
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What we want to do now is to find just one term of the asymptotic behavior 
of the large curly brace to the nth power, and of course, it’s that nth power 
that causes the difficulty. 

To illustrate the method in a simpler context, consider (1 + — ) n . What 
does this behave like for large n? Does it approach 1? We know that it 
doesn’t; in fact it approaches e. So the correct asymptotic relation is 


1 + 


1 


n 


n 


oo). 


Hence, although 1 + ^ ~ 1, (1 + ^) n ~ e. In general, one cannot raise 
both sides of an asym ptotic equality to the nth power and expect it still to 
be true. In exercise 7 below there are a number of situations of this kind to 


think about. 

To take a slightly harder example, how would we deal with 


1 

1 H f= 

n 


(5.4.12) 


What does it behave like when n is large? The way to deal with all of these 
questions is first to replace (1 + • • -) n by exp {n log (1 + ■ ■ •)}. Next, the 
logarithm should be expanded by using the power series (2.5.2), to get 

( 1-1 ) n = exp {n log (H )} 

= exp — (...) 2 /2 + (... ) 3 /3 -•-}). 


The infinite series in the argument of the exponential must now be broken 
off at exactly the right place. Terms can be ignored beginning with the first 
one which, when multiplied by n, still approaches 0. More briefly, we can 
ignore all terms of that infinite series which are o(n -1 ). 

In the example (5.4.12) we have 


1 


1 H — -j= = exp < n log 1 H — -j= 


1 


exp {" (iTi? “ ^ + ° (re_3/2) ) } 


~ exp <^7T 
= exp < \jn - 


-= - — 1 

n 2 n J 

1 


Now that we have that subject under our belts, we can return to the 
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real problem, which is (5.4.11) 
1 


We now find that 


1 - 


1 

+ 


2y / n 8 n 

= exp < n log ( 1 


1 


= exp < n 


1 

2 y/n 8 n 

+ TV1 


2 i/n 8 n ) 


exp (— y/n/2). 


Hence, from (5.4.11) 


_ 2^ + s) +°("" 3/2 ))} 


< ~n n/2 exp(- v / n/2). 


(5.4.13) 


That finishes the estimation of the three quantiti es that are neede d by 
Hayman’s t heorem. The result, obtained by putting (5.4.9), (5.4.10) and 


(5.4.13) into (5.4.6) 


is that 




n\ 


r\j 



2n 2 y/mr 


Finally, if we multiply by n\ and use Stirling’s formula, we obtain, for 
the number of involutions of n letters, 


t n ~ 


-^n n/2 ex p 
V2 



+ \fn 



(5.4.14) 
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Exercises 

1. Use the LIF to show that the (infinite) binomial coefficient sum 


« = £ 


sL + 1\ A 


— sL — l 


) ( S L + 1) 


for A > 1 and integer L > 0, satisfies + 1 = 0. 

2. The Legendre polynomials { P n (x ) } are generated by 


1 

\[\ — 2 xt + t 2 


£^«W( n . 

n> 0 


Let x be a hxed complex number that lies outside the real interval [—1,1], 
and let r denote that one of the two roots of the equation r 2 — 2 xt + 1 = 0 
which is > 1 in absolute value. Use the method of Darboux to show that, 

as n — > oo, 


Pn(x ) ~ 


T n + 1 

^nn(r 2 — 1 ) 


3. If u = u(t ) satisfies u = t(p(u ) and n > 0, show that 


[u n mu)r = [t r 


tu' (t ) 
u(t) 


= [t n ] 


(1 -t<f/(u(t))) 


4. Define, for all n > 0, 7 n = [x n ](l + x + x 2 ) n . 

(a) Use the result of exercise 3 above to prove that for n > 0, 

7 " = M ( vi-2l-3xJ ' 


(b) Show that, using the notation of problem 2 above, 

7 n = (V3/i) P n (i/V 3 ), 

and so obtain the asymptotic behavior of the sequence {7 n } for 
large n. 

5. Dehne, for integer p > 3, 



(n > 0). 
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(a) Exhibit S p (n ) as [x n ] in a certain ordinary power series, which 
(alas!) itself depends on n. 

(b) Nevertheless, use the LIF (backwards) to show that 

TS„(n)x"( l + xT ”- 1 =- — — — . 

V (1 -x)(l - (p- l)x) 


(c) Deduce from part (b) that the {S'p(n)} satisfy the recurrence 


Em)* ( pn - (p k - i)k )s r i.n- k ) = ^ p i >: + ;~ 1 (n > o). 


(d) If F(u) = En >0 S p (n)u n , let 


x = 


iP~ 1) 


in part (b) to show that 


(P-1 ) P_1 j\ (P-1) 3 ,2 


p P 

as e — > 0. 

(e) If 

then show that 


2p 


€ + 


P 


ip- 1)0 — 2)e 


+ 0 ( 1 ) 


7r» _ IIP- 1 

p(:c) = F f — x 


pp 




(p - 2 ) V W \/i 


a; 


(f) Use Darboux’s method to show that, as n — > oo, 


S p (n) 


1 , /© 


(p — 2) y U7T \{p — i) p 1 


pp 


(g) From part (b) show that 
"4u 2 x n 


E S 3( 

n>0 


n 


27 


u 


2 u 


u — 2sin(|sin 1 it) 


2u — 3 sin ( i sin 1 it) 
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6. Under what additional conditions on a polynomial P with nonnegat-ive 
real coefficients will there exist an N such that for all n > N we have 

[; z n ]e p <*) > 0? 

7. Find the asymptotic behavior (main term) of (1 + e n ) n if 

(a) e„ = n“ (0 < a < 1), 

(b) e n = n~ a (0 < a < 1), 

(c) e n — n~ a logn (1 < a < 2). 

8. The purpose of this problem is to find the asymptotic behavior of the 
number a n of permutations of n letters whose cycles are all of lengths < 3, 
by using Hayman’s method and the Lagrange Inversion Formula. (The use 
of a symbolic manipulation package on a computer is recommended for this 
problem, in order to help out with some fairly tedious calculations with 
power series that will be necessary).) The egf of {a n } is 

f( z ) = ex.p{z+^ + ^}. 


(a) Show that / is admissible in the plane. 

(b) Because r n in this case satisfies a cubic equation rather than a 
quadratic, as in the example in the text, we will use the LIF to 
find the root and its powers with sufficient precision. Show that 
if we write 

u = l/r n ; t = rU 1 / 3 ; 4>(u) = (1 + u + u 2 ) 1 ^ 3 , 

then u satisfies the equation u = t(j)(u ), which is in the form 
(5.1.1). 

(c) Use the LIF to show that the root r n has the asymptotic expansion 

1 1 11 H81 _ 5/3 . 

r„-^J3 + 3^ + 3n + 8l^ + ° (n >■ 


(d) Explain why the number of terms that were retained in part (c) is 
the minimum number that can be retained and still get the first 
term of the asymptotic expansion of a n with this method. 


(e) Show that- 


n 


exp 




(f) Show that- 


b(r n ) ~ 3n. 
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(g) Show that 


/On) ~ exp <1 + in 2 / 3 + jjn 1 / 3 - ^ 


(h) Combine the results of (d), (e), (f) to show that the number of 
permutations of n letters that have no cycles of lengths > 3 is 


2 71 

n a 

a n ~ exp 




2 n 1 o/o 5 i /o 29 

— — + -n 2/3 + -n 1/3 
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9. Derive the power series expansion (2.5.16) 


10. In this exercise, a(n,k ) is the number of involutions of n letters that 
have exactly k cycles, and t n = a(n, k ) is the number of involutions of 
n letters. 


(a) Show that 


E 

n.k 


c(n, k ) fc _ 1^ 2 ) 


n: 


-x y = e a 


(b) Hence find the formula 

cr(n, k) = 


n\ 


(n — k)\(2k — n)\2 n k 


for <j(n, k). 

(c) Using the results of part (a) and problem 5 of chapter 3, show 
that the average number of cycles in an involution of n letters is 
exactly 




tn— 1 


(d) Using (5.4.14) 


show that the average number of cycles in an in- 


volution of n letters is 


71 I 

= 2 + 2 ^( 1 + °( 1 )) (n^oo). 


Appendix 

Using Maple* ** and Mathematical 

Many branches of mathematics that were formerly thought of as being fit 
only for humans, are being invaded by computers. First, elementary school 
students learned how to multiply numbers with many digits and then found 
out that little calculators could do it for them. Other kinds of mathematics 
that are taught in secondary schools that now can be done by computers 
include expanding and factoring algebraic expressions, solving linear and 
quadratic equations, plotting graphs of curves and surfaces, doing logarithms 
and powers, and more. 

At the university level we find now that “computer algebra” programs 
can differentiate functions symbolically, do integrals, vector analysis, linear 
algebra, etc., all symbolically, rather than numerically. 

Here we want to show how computers can easily handle much of the 
routine work that is involved in solving problems about generating functions. 
To emphasize this point, we will show how well computers can do some of 
the homework problems in this book! Very well indeed, we’re sure you will 
agree. 

In this brief Appendix we’ll discuss first how computer programs can 
do extensive manipulations of power series. Next we’ll focus on one such 
program, Mathematica™ (Version 2.0) , and tell you about its amazing built- 
in RSolve function. Finally we will look at how Maple™ handles asymptotics, 
which can be quite a boon for problems such as those we looked at in the 
previous chapter. 


1. Series manipulation 

In Mathematica™, the instruction Series [f ,x,x0,m] will display the 
first m + 1 terms of the power series expansion of / about x = xO. Thus, 
to see the first 10 terms of the series for sinx/(l + x), about the origin, you 
would enter (the Maple™ instruction that would accomplish the same thing 
would be series (sin(x) / (1+x) ,x=0,9) ) 

Series [Sin[x] /(1+x) , {x,0,9}] 

and Mathematica™ would respond 


x — 


7 x 1 * 3 * t 47 x 5 
IT + 40 


5923 a: 7 426457 a: 9 _ uo 

~ 5040 ^ 362880 ^ ° ^ 


Perhaps you’d like to 


series 


then enter 


(2.5.10) of Chapter 2, 


reck the accuracy of the terms displayed in the 
and to see what the next two terms are. If so, 


* Maple is a registered trademark of Waterloo Maple Software. 

** Mathematica is a registered trademark of Wolfram Research, Inc. 
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and you will see 


Series [(1-Sqrt [l-4x] ) /(2x) , {x,0,9}] 


1 + x + 2 x 2 + 5 x 3 + 14 x 4 + 42 x 5 + 132 x 6 + 429 x 7 + 1430 x 8 
+ 4862 x 9 + 16796x 10 + 58786 a; 11 + 0(a;) 12 . 


If you want to obtain the list of coefficients of the terms of this series, because 
they are the numbers that the series “generates,” then ask for 

Coeff icientList [*/ 0 ,x] 

t-o obtain (the “%” means the result of the computation in the preceding 
line) 

{1, 1, 2, 5, 14, 42, 132, 429, 1430, 4862, 16796, 58786} 

and there are the Catalan numbers on display. 

If you want to see only the coefficient of x 7 then you would enter 

Coefficient [°/ 0 ,x, 7] 

instead, and the 429 would appear. 

A little more work is needed to see sequences that are generated by expo- 
nential generating functions. Suppose you wanted the first- 12 Bell numbers. 
According to theorem 1.6.1 these are the coefficients of x n /n\ in 

Series [Exp [Exp [x] -1] , {x , 0 , 12}] . 

If you t-ype exactly that, Mathematica™ will reply with 


1 + x + x 2 + 

1007 a; 9 
+ 17280 


5 a; 3 5 a; 4 13 a; 5 203 a; 6 877 x 7 

+ ~8~ + 30 + 720 + 5040 
4639a; 10 ( 22619a; 11 ( 4213597a; 12 
+ 145152 + 1330560 + 479001600 


23 a; 8 

+ 994 


+ 0(a;) 13 , 


which isn’t quite what you wanted because, for instance, the coefficient of 
a; 8 /8! is not readily apparent. One more instruction, such as 

Table [j ! Coefficient ["/, ,x, j] , { j ,0, 12}] 

will get the desired display of Bell numbers, 


(1, 1, 2, 5, 15, 52, 203, 877, 4140, 21147, 115975, 678570, 4213597}. 


2. The RSolve.m routine 

The RSolve package was written in Mathematica™ by Marko Petkovsek 

[Pe]. Its purpose is to find symbolic solutions to recurrence relations and 

difference equations. It can do so by explicitly finding the ordinary power 

series or exponential generating function of the unknown sequence. 

To use it one first reads in the package with 


<<DiscreteMath/RSolve .m 
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One then has a powerful facility for finding generating function solutions to 
problems in combinatorial recurrence. 

Let’s try it on the Fibonacci recurrence, with the call 

RSolve [{f [n+2] ==f [n+1] +f [n] , f [0] ==0,f [1] ==l} ,f [n] ,n] . 

It replies, after an order to Simplify [%] , as follows. 


{{/(™) 




+ (] + #) 



If(n > 1, 1, 0) 

}}, 


which is, of course, the explicit formula for the Fibonacci numbers. If you’re 
ready for this, let’s change the call above by replacing “RSolve” by “Gener- 
atingFunction,” and adding one more argument, x say, to tell it the variable 
to use in the generating function. That means that we enter the request 

GeneratingFunction [{f [n+2] ==f [n+1] +f [n] ,f [0] ==0 ,f [1] ==l} ,f [n] ,n,x] . 

And what is the reply? It is 


{{ 


x 

1 — x — X 2 


}}, 


which even in an age of multitudinous computer miracles must leave us in 
awe. 

Perhaps you’d rather have the exponential generating function of your 
numbers. Well then you would change the call to 

Exponent ialGeneratingFunct ion [{f [n+2] ==f [n+1] +f [n] ,f [0] ==0 ,f [1] ==l} ,f [n] ,n,x] 
and the computer would inform you that 


{{ 


— e 


(l-v^P 


(l + \/5) x 

+ e 2 




}} 


is the function you seek. 


Now let’s watch it solve the recurrence (2.2.6) 


for the number of block 


fountains of coins that have k coins in the first- row. This time the call is 


GeneratingFunction [f [k] ==1+Sum [ (k-j ) f [j] ,{j ,l,k}]/ ;k>=l, f [k] ,k,t] , 

and the response is 

:-i+*) t 


«- 


:}} 


in agreement with (2.2.7) 


1-3 t + t 2 

It can even find a closed formula for the number 


of such fountains from the generating function. To get that, ask for 

Simplify [SeriesTerm ['/, , {t , 0 ,n|] ] 

and the output will be 


{{ 


(5- V5) 


3 , V5 
2 ' 2 


10 


+ 


3 y/5 

2 2 


(5 + a/5) 


10 


If(n > 0, 1, 0) 


If(n = 0,1,0)}}. 
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As you can see, it did the partial fraction expansion followed by two g eometric 
series manipulations, just as we did to obtain, for instance, (1.3.3) 

The package can also find closed form expressions for the sums of series 
in which formulas are given for the nth coefficient. A request 

PowerSum[a n+b , {z ,n}] 

will produce the answer to exercise 1(b) in this book, in the form 


b az 

1-2 + (~l + z) 2 ' 


It can do muc’ 
that we did in 


i harder ones than that, like the gf of the harmonic numbers 


Example 5 of chapter 2 


That one is the answer to the call 


PowerSum[Sum[l/j ,{j,l,n}] ,{x,n}] , 


namely 


log(l — x) 
1 — x 


The reader who takes the time to experiment with the capabilities of the 
RSolve.m package will be amply rewarded. 


3. Asymptotics in Maple ™ 

In Maple™, if you type asympt (f ,x,n) ; you will receive n terms of the 
asymptotic expansion of the function / of the variable x, as x — > oo. Let’s 
try Stirling’s formula first, by asking for 

asympt (n ! ,n, 5) ; 

The computer’s answer is (we use ‘PT instead of £ 7 t’ etc. because that’s 
pretty much how it will look on your screen) 


(V/w/v / 2 + 1/12 


2 1 /2p i l/2 

n 1 / 2 


+ 1/288 


2 1 /2p i l/2 

T7.3/ 2 


139 2 1 / 2 Pi 1 / 2 
51840 ^V2 


571 2 1 / 2 Pi 1 / 2 

2488320 tfT 2 



/((l /n) n exp{n)). 


We all know that (1 + l/n) n — e, but how fast does it go? The answer 
given by Maple™ is 




In closing, let’s do exercise 8(c) of the previous chapter, which asks for 
the asymptotic behavior of the nth power of 


1 

r n 


1 11 11 

n 1 / 3 3 n 2 / 3 "I" 3 n 


8 1 

8l/p73 


+ 0(n~ 5 / 3 ). 
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Needless to say, Maple™ is up to the task, and gives 

exp |in 2/3 + ^n 1/3 j(l + 0(1)). 


Exercises 


On any computer that is available to you, do the following. 
1. Exercises 


1, 2, 5, 6, 8 if Chapter 1. 


2. Check the first five terms of any five of the series displayed in section 2.5. 

3. Exercises 


1, 2, 4 


of chapter 2. 


4. Use the “series” command to find the first 15 values of g(n) of (3.9.1). 

5. From (3.8.3) tabulate the number of involutions of n letters, for n < 15. 

6. Use the asymptotics capability of Maple™ to find the first 5 terms of the 
asymptotic expansions of the following. 


(a) (l + l/v^) n 

(b) \fn\ 

(c) (1 + 1 /n)'S™ 

(d) sin(sinl/x) 
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Answers to problems for chapter 1 


(a) (xD)(l/(l — x)) = x/(l — x) 2 

(b) ( axD + /3)( 1/(1 — x)) = ax /(l — x ) 2 + /5/ ( 1 — x) 

(c) (xD) 2 ( 1/(1-*)) 

(d) (a(xD) 2 + (3xD + 7)(1/(1 — x)) 

(e) P(xD){ 1/(1 -x)) 

(f) 1/(1 — 3a;) 

(g) 5/(1 -7x)~ 3/(1 -4x) 

2 . 

(a) ( xD)e x = xe x 

(b) (axD + (3)e x = (ax + /3)e x 

(c) ( xD) 2 e x = (x + x 2 )e x 

(d) (a(xD) 2 + /3xD + r y)e x 

(e) P(xD)e x 

(f) e 3x 

(g) 5e 7x -3e 4x 

3. 

(a) f(x) + c/( 1 -x) 

(b) af(x) + c/( 1 — x) 

(c) xDf(x) 

(d) P(xD)f(x ) 

(e) f(x) - a 0 

(f) f(x) — Oo — CL\X + (1 — Cl 2 )x 2 

(g) (f(x) + f(- X ))/2 

(h) (f(x) - a 0 )/x 
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(i) (f(x)-'£,o~ 1 a j x 3 )/x h 

(j) (f ~a o - a x x)/x 2 + 3 ((/ - a 0 )/x) + / 

(k) (/ - a 0 - aix)/x 2 - ((/ - o 0 )/x) - / 

4. 

(a) /(x) + ce* (b) af(x) + ce x (c ) xf(x) 

(d ) P(xD)f(x) (e) / — a 0 (f) / - a 0 - aix + (1 - a 2 )x 2 /2 
(g) (f(x) + f(-x))/2 (h) f’(x) (i )D h f(x) 

a) r + 3f + / (k) r -r-f 

5. 

(a) 2 n /n! 

(b) ct n 

(c) (— l) m if n = 2m + 1 is odd, and 0 else. 

(d) (o n+1 - 6 n+1 )/(o - 6) 

(e) („?J 

6 . 

(a) We see at once that f/x = 3f+2/(l—x), so / = 2x/((l— x)(l— 3x)). 

(b) f /x = af + (3/(1 — x) so / = /3x/((l — x)(l — ax)). 

(c) Here (/ — x)/x 2 = 2//x — f so f = x/(l — x) 2 . 

(d) Since //x = //3 + 1/(1 — x) we have / = 3x/ ((1 — x)(3 — x)). 

8 . 

(a) /' = 3/ + 26®, /(0) = 0 give / = e 3 * — e* 

(b) f' = af + (3e x so f — {(3/(1 — a))(e x — e ax ) 

(c) f" = 2/' - /, /(0) = 0, f(0) = 1 yield / = xe* 

(d) f = //3 + e x , f( 0) = 0 give / = f (e* - e*/ 3 ) 

9. Multiply both sides of the equation f(2n) = f(n) by x 2n and sum over 
n > 1. Then multiply both sides of f(2n + 1) = f(n) + f(n+ 1) by x 2n+1 , 
sum over n > 1, and add to the previous result. Then add f(l)x = x to t-hat- 
result- to obtain the functional equation. 

To find the explicit infinite product form of the solution, let’s first see how 
we might guess that answer, and then how we might prove it. 

Take the functional equation for F. and replace x by x 2 throughout, then 
substitute the result back in the functional equation, to get 

F(x) = (1 + x + x 2 )(l + x 2 + x 4 )F(x 4 ). 

If we now replace x by x 2 again, and substitute we’ll get even more factors of 
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the infinite product. Hence we should suspect that the product is the answer. 
To prove that the product is the answer, we have two choices. First, over 
the ring of formal power series, consider the product as a formal beast which 
obviously satisfies the functional equation for F. Second, analytically, an 
infinite product f([ ( 1 + Qn ) converges if the series ^ \q n \ does; so, the product 
converges for |x| small enough, to an analytic function F. 

10 . For part (a) see section 4.1. 

(b) p ^ = j)Prob(X = n — j) = [x n ]P(x) 2 . 

(c) P k (x) =P(x) k 

(d) By part (c) the mean is 

nm/p*w = [kp(x) k - i p\x)/p( x ) k ] x=1 = 4 «, 

and the variance is 


(log Pk(x))' + (log P k {x))" 


X—l 


A’(logP(x))' + (logP(a;)) // 


5 


X = 1 


which is ka 2 . 

(e) Since A k = B we have kA' /A = B' /B or kA'B = AB' . Equate the 
coefficients of x n to find that nb n = ^” =1 (j (k + 1) — n)ajb n -j for 
n > 1, with bo = 1. 

(f) p* is the coefficient of x 300 in (.lx + .2x 2 + .lx 3 + .2x 4 + .2x 5 + 
,2x 6 ) 100 /(1 — x). Numerically, it is about .00000095. 

(g) The required probability is 


1 rp I /y> 2 I ... I rpFkl 1 ( 1 ^TTl \ TL 

l / x x i I dj \ n J- r j—n] v -1 ^ J 

1 — x^ m m n X (1 — x) n+1 ' 


The result follows by expanding the numerator by the binomial 
theorem, the reciprocal of the denominator by the binomial series, 
and multiplying. 


11 . Among these subsets we distinguish those that do contain n and those 
that don’t. If such a subset contains n, then the rest of that subset is one 
of the subsets that is counted by f(n — 2). If it does not contain n then 
the entire subset is one of those that is counted by /(n — 1). Thus /(n) = 
fin — l) + f(n — 2), which together with the starting values /( 1) = 2, /( 2) = 3 
tells us that f[n) = F n+ 2 , where the P's are the Fibonacci numbers. 

12. As in problem 11, we distinguish those ^-subsets that do contain n and 
those that do not. If such a /e-subset does contain n then the rest of that 
subset is one of the subsets that is counted by /(n — 2, k — 1), otherwise the 
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entire /c-subset is one of those that is counted by f(n — 1, k). Thus f(n, k ) = 
f(n — 1, k ) + f(n — 2 ,k — 1), for k > 2. If we define F k (x) = ^ n>1 f(n, k)x n , 
then after multiplying the recurrence by x n and summing over n > 1 we find 
that Fk(x ) = x 2 Fk~ i(x)/(l — x), which together with F\{x) = x/{l — x) 2 
tells us that Ff~(x) = x 2k ~ 1 /(l — x) k+1 . If we expand in the binomial series 
we find that 


f(n,k) = [x n ]F k (x) 




13 . From problems 11 and 12, it must be that 



(n > 0), 


which will be proved another way in example 1 of section 4.3. 


14 . A circular arrangement that does contain n is obtained by taking a linear 
arrangement of {2, 3, . . . , n — 2}, no two consecutive (on the line), adjoining 
n to it, and laying it out around a necklace. So by exercise 11, there are F n _ i 
such arrangements. One that does not contain n is obtained by taking any 
linear arrangement of { 1 . 2, . . . , n - 1 } and laying it out around a necklace, so 
there are F n+ i of these. Hence there are F n - 1 +F n+ i such circular sequences 
altogether. 


15 . As in the previous problem, the answer is f(n — 3, k — 1 ) + f{n — 1, k ) 
where f(n, k ) = ( n- ^ +1 ) is the solution to exercise 12. 

16 . The partial fraction expansion of 1/(1 — x 2 ) 2 is 


1 


+ 


1 


+ 


1 


+ 


1 


4(1 — x) 2 4(1 — x) 4(1 + x) 2 4(1 + a;) 


Therefore the coefficient of x n in its power series expansion is 


n + 1 1 (— 1) (n + 1) 

1 h — + 

4 4 4 


(-i)" 

4 


which is 0 if n is odd and is (n + 2)/2 if n is even. Otherwise, take the series 
for 1/(1 — u) 2 and replace u by x 2 . Even sneakier would be to use one of the 
symbolic manipulation programs that are now available on computers. They 
can produce the general term of such series on demand. 

IT. Fix j, 1 < j < n, and consider just those permutations a of n letters that 
have a(j) = n. Then no inversions have j as the second member of the pair 
and exactly n — j inversions have j as the first member of the pair. Hence if 
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we delete n from the string of values of a we obtain a permutation of n — 1 
letters with n — j fewer inversions. Thus b(n, k ) = JT b(n — l ,k — n + j). 

Multiply by x k and sum on k to obtain B n (x) = (l + x-| bx n_1 ).B n _i(x). 

Hence b(n, k ) is the coefficient of x k in 

(1 + x)(l + x + x 2 )(l + x + x 2 + x 3 ) • • • (1 + x + x 2 + x 3 + • • • + x n_1 ). 

18 . 

(a) The probability is evidently l/k\ that the first k values will decrease, 
so n\/k\ of the permutations have this property. 

(b) The probability that a permutation begins with k decreasing values 
followed by an increasing one is l/k\ — l/(k + 1)!, if 0 < k < n, and 
is 1/n! when k = n. The average value of k, weighted with these 
probabilities, is 

n—1 1 1 \ n i 

E , / 1 1 \ n ^ 1 

k\ (k + l)\ + n\ kV 

k=0 \ ' J / fc=i 

and therefore an average permutation of n letters begins with a 
decreasing sequence whose length is approximately e — 1. 

(c) If we begin with a permutation of n — 1 letters that has k runs, 
then by inserting the letter n in each of the n possible places we 
manufacture k permutations of n letters that have k runs and n - - k 
permutations of n letters that have k + 1 runs. Thus f(n, k ) = 
kf(n — 1 , k) + (n — k + 1 )/(n — 1 , k — 1). 

19 . 

(a) It is (1 + x) 2 (l + x 2 )(l + x 5 )(l + x 10 ) 2 (l + x 20 )(l + x 50 ). 

(b) The sum represents 3 8 = 6561 integers, each between —99 and 99. 
Hence these 199 integers are represented an average of 6561/199 = 
32.9.. ways, so some integer must be represented at least 33 ways. 
The required product is 

(l/x + l + x) 2 (l/x 2 + l + x 2 )(l/x 5 + l + x 5 )-- - (1/x 50 + 1 + x 50 ). 

(c) If Wi, • ■ ■ , w r are distinct integers, and if D n is the number of repre- 
sentations of n as a sum n = w\X\ + W 2 X 2 + ■ — b WkXk where each 
of the Xi is ±1, then 

r 

J2Dnt n = l[(t Wi +t- Wi ). 

i—1 


n 
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The set of roots is the union of the sets of 2with roots of — 1, for 
i = 1, . . . , k. 


Answers to problems for chapter 2 

1. The thing to remember is that 1/(1 — u) = 1 + u + u 2 + ■ ■ ■. 
(a) We have 

1 1 


cos a; 1 — ( 


x xr_ 

2 24 


+ •••) 


2 4 2 4 

ry^ ry A /y>" /y A 

/ to to , / tO tO » O 

= 1 + ( y“24 + "' ) + ( Y“24 + "' ) + "' 

/-y>2 C\.7>4 

tO t/tO 

= 1 + T + 1T + "' 


(b) Here we use the binomial theorem with negative exponent. 

1 


(1 + x) r 


= (1 + x)~ 


E 


— m 


x 


-m \ / — m \ 2 / — rn \ 3 


= 1+l 1 j I+ ( 2 3 |X ° + 

„ m(m + 1) 9 m(m+l)(m + 2) o 

= 1 — mx H -a; 2 — -x 3 + 

2 6 


(c) This is like part (a). We find 

1 


1 + (t 2 + t 3 + t 5 H ) 


= l-(f 2 +f 3 +f 5 + •••) + ••• 


= 1 - f 2 - f 3 + f 4 + f 5 + • • • . 


2. In each case (except (e)), replace x by x + bx 2 + cx 3 + • • •, set the result 
equal to x, and equate the coefficients of like powers of x to 0 to solve for b 
and c. 

(a) x + ^ -\ 

(b) x - ^ 

(c) x — x 2 + fa; 3 + • • • 

(d) x — x 3 + ■ ■ ■ 
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(e) In this part, note that if y is the inverse function, then 

log(l - y) =x, 

i.e., y — \ — e x = —x — x 2 /2 — x 3 /6 — ■ ■ ■ . 

3. If / = E fc >o a ^ xk then 

0 = f" + / = + 2)(k + l)nfc+2 + cik}x k 

k>0 


and so 


dk + 2 = 


Q>k 


(k = 0,1,2,...). 


(fc + l)(fc + 2) 

If do and ai are arbitrarily hxed, then by induction on k 

ci2k = (— l) fc a 0 / (2fc)! 


and 

« 2 fc+i = (— l) k ai/(2k + 1)! 
for all k > 0, and the result follows. 

4. 

(a) (1 _ x x )2 + nW 

(b) A 

(c) T^ 

(d) {\og(^)-x-^}/x 

(e) {e x — 1 — x — x 2 /2 — x 3 /6 — tc 4 /24} / tc 5 

m T d f 2 i _ 

' 1 da: 1 1 — a;— a: 2 J (1 — a;— a; 2 ) 2 

(g) {(xDxD) + (xD) + l}(e x — 1) = (1 + x) 2 e x — 1 

5. In the binomial theorem (1 + x) n = e* ©a let x = 1. 

6. We have 

f(n,k)= ^2 nin 2 ---n k 

niH fnfe=n 

= [x n ]{J2 rxr ) k 

= ^{(w } 1 

k 


(i — x) 2fc 



204 


Solutions 


Hence f( n -> k ) xU = 7T=0W- Explicitly, since 

V + 2k - 1 


( 1 — a ;) 2 

1 


(1 ~x) 


2k 


E 

r> 0 


r 

X , 


we find that f(n,k ) = ( n ^ fc 1 ). Notice that the answer is 0 when n < k. 
Explain why. 

7 . As in problem 6 above, 


f(n, k, h ) 



fc 


a; 


x 


fc/i 


(1 — x)* 


8. 1, 1, 1, 1, and 1, respectively. 

11 . 1, 1, 5“T 0, 1, respectively. 

13 . Let a and b be relatively prime. Then every divisor d of ab is uniquely 
of the form d = d'd" where d'\a , d"\b. Hence 

g (ab) = f ( d ) = f ( d ' d ") 

d\ab d'\a 

d"\b 

= E /(d')/(rf") = (£/(<?))( E /(<*")) = 9(«)sW- 

d'\a d'\a d"\b 

d"\b 


14 . Consider the n fractions 1/n, 2/n, . . . , n/n. If we write them in lowest 
terms, then each of them will reduce to a fraction h/k, where k\n and h,k 
are relatively prime. Further, for a fixed divisor k of n, each of the 0(h) such 
fractions h/k occurs in exactly one way, i.e. , by reducing exactly one fraction 
m/n. 

15 . For Euler’s function, apply Mobius inversion to the result of problem 14 
above. This gives 

<Kn) = E ^n/d)d = n E = » E 

d\n d\n d\n 

Since // is multiplicative, fi(n)/n is a multiplicative function of n, and so, by 
problem 13, is the last member above. 
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For a(n), suppose a, b are relatively prime. Then every divisor of ab is 
uniquely of the form d'd" , where d! and d" are divisors of a and of b, respec- 
tively, and the result follows. Finally, if a, b are relatively prime, |/i(a6)| = 1 
iff ab is squarefree iff a and b are squarefree. 

16. 

(a) C(«“ !)/C(«) 

(b) C(«K0 - 1) 

(c) C00/C(2a) 

17. 

(a) C(«“ 1) 

(b) ((s-a) 

(c) -C'W 

(d) C(s)C(s-q) 

18. 

(a) C(*){C(* - 1)/C(«)} = C(« - 1) 

(b) COOI1/COO} = 1 

(c) {i/as)Hc 2 (s)} = as) 

19. We hnd that 




1 / 5- y/E 5 + y/b 


+ 


1 — a+x 1 — a-X 
where a± = (3 ± y/b)/2. Hence there are exactly 


5 — y/b U 5 + y/b 


10 


-al + 


10 


-a_ 


block fountains whose first row contains k coins. 


(a) En /( n > T ) xn = (Eter xt ) k 

(b) En 9(n, k, T)x n = [y k ] EUrt 1 + V x *) 

(c) E n f(ri,k,S,T)x" = [y^UteAEses^} 


22. Check that the required number is the coefficient of x" in 
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hence f(n) = 0 for all n except that /( 1) = — 1. 
23 . On the one hand, 

x(e mx - 1) 


e x — 1 


x 


e x — 1 


(e mx - 1) 


E B n n 

—x n 

n\ 


E 


m J x 3 


/y» ' L I 

E^riE 


n> 0 


i>i 


i>i 
n 


B n -jm° 


On the other hand, 

x(e mx - 1) 

e x - 1 


= x 


- 1 


e x — 1 

= z(l + e* + e 2 * H he 

m— 1 

-EE 




) 


‘T r 

j x 


j = 0 r> 0 
r+1 


J* I 




->0 


where S r (m) is the sum of the rth powers of the integers 1 , ,m. If we 
compare the coefficients of x n we find the explicit- formula 


s " (m) = E 


•>1 


n+ 1 
r 


B n +i- r {m + 1 ) , 


which holds for integers m,n > 1. The first few cases are, for n = 1.2.3, 


m m 

1 + 2 + ... + m= — + — 

m 3 mn 2 m 


i 2+22 + ... + m2 = _ + _ + _ 

4 3 2 

o o m m m 

I s + 2 3 H h m 3 = — — I h — . 


25 . 

(a) If they differ in the jth bit, then their colors differ by j which is not- 
0. If they differ in the jt-h and k\ h bits, then their colors differ by 
j + k or j — k modulo 2 n, neither of which can be 0. 

(c) f(z) = Uk=A 1 + zk )- 
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27 . 

(a) e ~ x /(I — x). 

(b) If D(x) = e~ x /(I — x ) then (1 — x)D' = D — e~ x . Match [x n ] on 
both sides. 

(c) If Di(n) is the number with one fixed point then D\(n) = nD(n— 1). 
Thus D(n) — Di(n) = D(n ) — nD(n — 1) = (— l) n by part (b). 

(d) To construct a permutation of n letters that has k hxed points, 

we can choose the k hxed points in ( ? /) ways, and the rest of the 
permutation in D(n — k ) ways. Hence D^(n) = — k ). Now 

multiply by x n y k /n\, sum, and use part (a). 

28 . We have 


E /*("/<*) a d (x n / d ) = J>(> 

d\n d\n 5\d 

in which the coefficient of b r (x n ^ r ) is E< 5 \n/r M n / ( r ^)): which vanishes unless 
r — n and is 1 in that case. 


30 . 


(a) En> 1 vV™ S = En>l l / nS 1/2 = C(s - 1/2). 


(b) The function is multiplicat ive anc 


and 1 otherwise. Hence by (2.6.6) 


its value at n = p a is 0 if a > 2 
the generating function is 




P 


-2s 


p~ 


C(«) 

C(2a)' 


(c) Here A (p a ) = (—1)“ for all a > 0, so by (2.6.6) its generating func- 
tion A(s) is 


p p 


C(2s) 
CM ' 


Finally, equate coefficients of n s on both sides of A(s)£(s) = C(2s). 


31 . 

(a) If 


n> 1 


E 

n> 1 


X 


1 — x r 


E^E^^E^E^ 

n m>l r d\r 


thus b n = Ed\n a d • just as in the theory of Dirichlet series. 
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(b) Apply part (a) with a n = p(n). 

(c) It is 


X 


(i — xy 


32. 


n> 1 n> 1 

since E d \ n 0(<O = n - 

(a) //( 1-x) 

( b ) //(l-^) r 

(c) By part (b), it is the sequence whose gf is 1/(1 — x) r+1 , which, by 
(2-5-7) I is the sequence ( n + r ) n>0 - 


(d) It is the coefficient of x n in (^o n x n )/( 1 — x ) r , viz. 


E 

m = 0 


TO + r — 1 


TO 


ttn—m X — 0 ; 1 ; 2 , . . .). 


(e) Then f(x)/( 1 — x) r = 1, so /(x) = (1 — x) r and a n = ((()(— l) n for 
n > 0. 


33. 

(b) We have 


$ p a(x) = Y[( l- x d Y {pa / d) 

d\p a 

= -^- = l + x p + x 2p H bx (p_1)p . 

1 - xP 

(c) Since n m \ n ®m(x) = 1 - x n we have 

n* m (D="- (*) 

m\n 
ra> 1 

Now for n = p k use induction on k. If n is not a prime power let p a 
be the highest power of p that divides n. Then in (*) above, each 
divisor pi (1 <3 <a) contributes a factor of p, so all such divisors 
contribute p a . But no higher power of p divides n, so $ n (l) cannot 
be divisible by p. Since p was arbitrary, T,, ( 1 ) must be ±1, and it 
is easy to rule out —1. 
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34. 

(a) This says that each integer r, 1 < r < n is uniquely of the form 
r = rnd where d\n and gcd(m, n/d) = 1. But this is clear since we 
take d =gcd(r, n) and m = r/d. 

(c) Let x — *• u in the result of part (b), and use L’Hospitaks rule. 


Answers to problems for chapter 3 


1. A partition of n into odd parts looks like 

n = r\ ■ 1 + rs ■ 3 + r$ ■ 5 + • • • . 

Now substitute the binary expansion of each r*, to get 

n = (2 ai + 2 bl +•••)• 1 + (2“ 2 3 * + 2 6s + •••)• 3 + (2“ 5 + 2 b& H ) • 5 + • • • 

But now we have a partition of n into distinct parts, viz., 

n = 2 ai + 2 bl + • • • + 2“ 3 • 3 + 2 63 • 3 + • • • + 2“ 5 • 5 + 2 &s • 5 + • • • . 


(What partition corresponds to 39=3+3+7+7+19 ?) The map is uniquely 
invertible. 


2. The deck has a card corresponding to each cyclic permutation of length 

k,2k,3k, . . .. The number of these on mk letters is (rnk — 1)!. The deck 

enumerator is 


v ( x ) = 

m> 1 


(mk- !)! k 
(mk)! 


r^mk 

v — > ,L 

' mk 

m> 1 



1 

1 — X k 


Hence the hand enumerator, without regard to number of cards in the hand, 
is 

H(x) = exp | i log } 

1 

_ (i -x k y/ k 

= E 

m> 0 ' ' 

The required number is the coefficient of x n /n\ here, which is 0 if k does not- 
divide n, and is 


(-l) r 


i\ fll 

k )n! = ~ri—{k + l)(2fc + 1) • • • ((r — 1 )k + 1) 
) r\k r 


r 
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if n/k = r is an integer. 


3. It is exp <(E P fr }■ 


4. 


5. 


(a) The order is the least common multiple of the cycle lengths. 

(b) Clearly, 

g(n,k) = ^2g(n, d ). 
d\k 

Hence by Mobius inversion (2.6.12) we have 


g(n,k) = ^2n(k/d)g(n,d). 

d\k 


(a) One finds by logarithmic differentiation of the egf (3.8.3) 


that 


T n = T n _ i + (n - l)T n _ 2 (n > 2;T 0 = l;Ti = 1). 


(b) 1, 2, 4, 10, 26, 76 

(c) Consider separately those involutions of n letters for which n is a 
fixed point and those for which n is not fixed. 


6. The deck enumerator is 


V[x) = £^=log 


i>4 


and so the hand enumerator is 


(1 ~x) 


x — x 2 /2 — x 3 /3, 


H(x) = 


,—x — x 2 /2—x 3 /3 

(1-x) 


7. A card of weight n is a path or a cycle. If n > 3, there are ( n — l)!/2 
‘cycle cards’ of weight n, and if n > 2 there are n\/2 ‘path cards.’ Hence 


V(x) = 


1 


(1 ~x) 

and the hand enumerator 7i(x) is 


log (1 — x) — 1 — 2x — x 2 /2 ] /2, 


sinh 


1 1 ^ \ 1 

2(1 — x) 2 ’ 2 4 


,6 


,7 


/y»2 /y» 3 ryt 4 rf* 1 /y* 1 - 

= ^ + 4 ¥ + 15 ¥ + 72 ¥ + 435 ¥ + 3300 7f + 3031 V + ' ' ' ' 
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Hence the numbers of such graphs on 0, 1, . . . , 8 vertices are 0, 0, 1, 4, 15, 72, 
435, 3300, 30310. 

8. One finds [£] = \^~l] + (n — l)^ 1 ]. This can be proved directly by 
considering separately those permutations of n letters and k cycles in which 
n is a fixed point (cycle of length 1) and those in which it is not. 

9 . Here the deck enumerator is 


v ( x ) = 

m> 1 


(2m — l)\x 2m 
(2m)! 


log 


1 

y/l — x 2 


The exponential formula states that the question is answered by sinh 'V(x), 
which simplifies to 


H(x) 


2Vl-x 2 ' 


The coefficient of x n jn\ is g(n) — 0 if n is odd, and 


g(n) 


n\ fn — 2\ 

s^Vf-V 


if n > 2 is even. 

10 . We find that 



^ ryn — 1 

(k — r)!(r — 1)! ' 


12 . If F(n,k ) is the number of n-permutations whose cycles have lengths 
< k, then F has the egf exp (x + • • • + x k /k). But f(n, k ) counts those whose 
longest cycle has length k, so if k > 1, f(n, k ) = F(n, k ) — F(n, k — 1), and 
the required egf is 


^x-\ H 



13 . 

(a) It is 


E tidjQk 

i\j\k\ 

i+j-\-k=n 


(■ n > 0). 


(b) If we multiply through by n\ to get 


E 

i+j+k=n 


n\ 

iljlkl 


tidjdk — n! 


(n > 0). 



212 


Solutions 


then the multinomial coefficient under the summation sign counts 
the ways of choosing an ordered triple (R, S, T ) of subsets that par- 
tition [n], ti counts the involutions of i letters, each of which gets 
relabeled with the elements of R, (jj counts the 2-regular graphs of 
j vertices, each of which gets relabeled with the elements of S. etc. 
Finally, the right side n ! counts n-permutations. 

(c) This elegant solution was found by Mr. Douglas Kat-zman. Given 
the triple (r, G\ . GC), we construct the corresponding permutation a 
as follows the cycles of the involution r, acting on /?. become cycles 
of a. For each cycle in the graph G i, locate the smallest numbered 
vertex v in the cycle. Choose that one of the two possible ways of 
orienting the cycle which carries v to the larger numbered vertex 
of its two neighbors. Conversely, in G 2 select the orientation t-hat- 
carries the smallest numbered vertex of each cycle into the smaller 
of its two neighbors. 

14. 

(a) From the defining equation 


e y^(x) 


4>n(y) x n 
n! 


we see that each application of the operator D y multiplies the left- 
side by another T>(x), so the application of some function f{D y ) will 
multiply it by f(V(x)). If we choose / to be the inverse function 
p( _1 )(_D y ), then we will multiply the left side by 'D^~ 1 \V{x)), that 
is, by x. If we multiply the right side of the defining equation by x. 
we see that it becomes the egf of {n(p n -i(y)}, as claimed. 

(b) In this family, 

n 1 . = y\ (~ y \-i) n x n 

(l-x)y Z ^\nJ K 

sr^y(y + 1 )---(y+n-i)^ n 

/ • th . 


£ yV(x) _ 


Thus <j)n(y) is the ‘rising factorial’ y(y + 1) • • • (y + n — 1). To 
check the identity, we have first that the deck enumerator is 'D(.x) = 
— log (1 — x). Hence = 1 — e~ x . Therefore 

^ (_1) {D y )<l>n(y) = {1 - e~ Dy }(t)n{y). 


But Taylor’s theorem from differential calculus is identical with the 
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assertion that ( e D )f(y ) = f(y + 1) (Hcheck this!!). Hence 

(1 - e~ Dy )(p n (y) = <j> n (y) - cf) n (y - 1) 

= {y ■ ■ ■ (y + n - 1)} - {(y - 1) " • (y + n - 2)} 

= ny{y + 1) ■ ■ ■ (y + n - 2) 

= n4 n -i(y), 


as required. 

15 . The result claimed is certainly true if there is only one card in the deck. 
Then, by the merge, trickle, and flood argument, it is true in general. Part 
(b) is immediate. For part (c), insert the factors into the product, and outside 
the product write the reciprocals of all of those factors, to get 


k = l 

OO 


1 i r / X x x k 

nu + ir) 6 - 


(1-x) 


k= 1 


Now as x — » 1 , the infinite product approaches a certain universal constant, 
viz. 

OO 1 

c =n( i+ d e_i ' 

k= 1 

hence p(x) ~ (7/(1 — x). The constant is in fact e -7 , where 7 is Euler’s 
constant. 

16 . Here c n is the coefficient of x n jn\ in 


(1 + x) X+1 = (1 + x) x {l + x) = (1 + x) 


X 


X 


(! 


x(x — 1 ) • • • (x — k + 1 ) k 


k 


id 


k r 


k 


X 


Thus 


Cn 

n 


= (n — 1 )!< 


r^(-ir- fc 

1 


1 


k\ ( 

k k 

1 

3 

1 

?r 


1 

3 

1 

?r 

I 

h 


But in the sum the terms all vanish for k > n, hence the right side is an 
integer. 
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18 . The number o f cards in the jth deck is 1 for j = 1,2 and is 2 for j > 3. 
Hence by (3.14.6), 


1 


the hand enumerator is 

1 TT 1 


n 


P(x)‘ 


1 — x (1 — x 2 ) -*: 1 (1 — xi) 2 


(1 — x)(l — x 2 ) 


where P(x) is Euler’s generating function (3.16.3) for {p(n)}. 


19 . In such a tree there is a rooted tree of j vertices attached to one of the 
edges incident at the root, and a rooted tree of n — 1 — j vertices attached to 
the other edge at the root. Further, the full tree is completely determined by 
this unordered pair of trees, and so the number a n of such full trees is equal 
to the number of unordered pairs of rooted trees, the total number of whose 
vertices is n — 1 , i.e. , 

1 

a n ~ 2 / j 

j 

if n — 1 is odd, for then every unordered pair is counted twice by the sum. 
If n — 1 is even then we need to consider the number of ways that the two 
subtrees at the root can be of the same size (n — l)/2. The number of 
unordered pairs of not necessarily distinct objects that can be chosen from a 
set of a different objects if ( a ^ 1 ) • Thus in this case the formula above needs 
an extra term t( n - 1 )/ 2 / 2 added to it, which is equivalent to the result stated. 

20 . It is 29. 29 cannot be of the form stated, for otherwise we could subtract 
some multiple of 15 from it to find a nonnegative number of the form 6.x + lOy. 
But 29 is not of that form since it is odd, and 14 isn’t either. Next, if n is 
any integer that is representable then so is n + 6, so to see that every integer 
larger than 29 is so representable it is enough to observe that 30 = 6 • 5, 
31 = 6 + 10 + 15, 32 = 6 • 2 + 10 ■ 2, 33 = 6 • 3 + 15 ■ 1, 34 = 6 • 4 + 10 ■ 1, and 
35= 10-2 + 15- 1. 


21 . If f(n) is that number then 




1 


1 1 

+ 


i>0 


(1 — x)(l — x 2 )(l — x 3 ) 6(1— x) 3 4(1— x) 

17 1 1 1 

+ r + r + r + 


72(1 -x) 8(1 +x) 9(1 -ux) 9(1 — oox) ' 

If we expand each of the fractions on the right we find the formula 

,, , lfn + 2\ 1 . . 17 (-l) n 2 . 2n7r 

’ 6\ 2 j 4 l ; 72 8 9 V 3 ; 

which can be rewritten as 

( n + 3) 2 1 -7 + 9(— l) n + 16cos(2mr/3) 
jyn) — — I . 


12 


72 
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The second fraction cannot exceed 32/72 < 1/2 in absolute value, so f(n) 
is the unique integer whose distance from (n + 3) 2 / 12 is less than 1/2, as 
required. 

22. For a given aq, a 2 , ■ ■ •, we put n = aq + 2oq + • • •, and we can then 
construct all possible hands of the desired type by choosing and labeling 
cards from the given decks as follows. 

Make an ordered selection of aq cards of size 1 chosen independently from 
the d\ cards of size 1 that are available in deck 1. Then make an ordered 
selection of a 2 cards of size 2 from the d-x cards of that size that are available 
in deck 2, etc. The number of ways in which this can be done is d^d^ 2 

Next, for the aq chosen cards of size 1, choose the 1 label that will appear 
on each card, which can be done in n\/(n — aq)! ways, but since the order 
of these cards in the hand is immaterial, this labeling can be done in only 
n!/(oq!(n — aq)!) ways. 

Then, for the a 2 chosen cards of size 2, choose the unordered pairs of labels 
that will appear on each card. This can be done in 

(n — oq^ fn — aq — 2^ fn — aq — 2 a 2 + 2^ (n — aq)! 

V 2 JV 2 )" V 2 ) ~ (n — aq — 2a 2 )!2!“ 2 

ways (we need only the unordered pairs because the chosen cards have place- 
holders on them that tell us in what sequence to place the chosen label set 
on the card). Finally, since the order of the cards of size 2 in the hand is 
immaterial, there are only 


(n - oq)! 

(n — oq — 2a 2 )!2!“ 2 a 2 ! 


different ways to do this. 

In general, for the a :i chosen cards of size j, we can choose the sequence of 
sets of j labels that will appear on each card in exactly 

(n — oq — 2a 2 ( j ~ l)o.j-i)! 

(n — aq — 2 a 2 — • • • — a,j\ 

different ways. If we multiply all of these together, for all j > 1, we find t-hat- 
the number of hands of the desired specification is 

nld^d ?, 2 ■ ■ ■ 

l! ai 2!“ 2 • • -ai!a 2 ! •• •' 

But this is exactly the coefficient of • • • /n\ in the expansion shown 

in the statement of the problem. 



216 


Solutions 


For part (b), in the family of set partitions we have all dj = 1 for j > 1. 
Use the result of part (a), with X 3 — X 4 = • • • = 1, since we don’t care about 
classes of size greater than 2, to obtain the joint distribution of classes of 
sizes 1 and 2 in the form stated. 


Answers to problems for chapter 4 


1. We have p n = (1 — p) n l p for n > 1, hence {p n } has the opsgf P(x) = 
px/(l — (1 — p)x ). The mean is P'( 1) = 1/p, and from (4.1.3) the variance is 


a 2 = (logPy + (logP )"\ x=1 = 


(1 ~P) 


pz 


2 . 


(a) 


(b) 


Consider a sequence of n trials that yields a complete collection 
for the first time at the nth trial. From that sequence we will 
construct an ordered partition of the set [n — 1] into d — 1 classes, 
as follows: if the zth photo was chosen at the jth trial (1 < i < d, 
1 <??. — 1), then put j into the ith class of the partition. Note 

that d — 1 of the classes are nonempty. Conversely, from such an 
ordered partition of [n — 1] we can construct exactly d collecting 
sequences, one for each choice of the coupon that wasn’t collected 
in the first n — 1 trials. There are ( d — 1 ) !{/ ^ J j ordered partitions 

of [n — 1] into d~ 1 classes, so there are sequences of trials 

that obtain a complete collection precisely at the nth trial. There 
are d n unrestricted sequences of n trials, so the probability of the 
event described is as shown. 


(1.6.5) 


p(x) = (d — l)!x 

n 



( d — l)\x d 

(d — x) ■ ■ ■ (d — (d — l)x) 


p'(l)=d(l + i + --- + i) 


(d) From (4.1.3) 


a 2 = 




2 — 1 


About 29 boxes of cereal, with a standard deviation of about 11 
boxes. 
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3. For part (a), the probability p(j,vi,T) has two components. First, with 
probability d\/(d\ + 1), the walk begins with a step to another vertex of 7\. 
In that case the probability of a first return after j steps is the same as it 
was in Ti, which gives a contribution of 


di 

di + 1 


■p(j]v 1 ]T 1 ) 


to the answer. 

On the other hand, with probability 1/(1 + di) the walk begins by using the 
edge (vi,v 2 ). In that case the required probability will be the probability 
that the walk takes exactly j — 2 steps in the tree T 2 , finishing at v 2 and then 
crossing back over the edge (vi,v 2 ) to vertex v\. 

Fix m > 0, and consider the following event: the sequence of vertices that 
the walk visits after crossing to V 2 contains exactly m + 1 appearances of 
vertex v 2 followed by the return to v \ . Hence the sequence looks like 


v 2 , IFi , u 2 , 1F 2 , . . . , W rn , v 2 , 


where each of the W % is a sequence of vertices of T 2 — v 2 . The total number 

of steps in such a walk is ]\ H 1- j m , where the j t are the numbers of steps 

between consecutive returns to v 2 . We need the probability that j 1 + j 2 + 
• ' ' + jm = j — 2. But that is 


P(h', v 2] T 2)p(j2] V 2 ; T 2 ) • • - pUrn, V 2 \ T 2 ) ^ ^ ^ ^ 1 ) 


jl~\ K jm — j 2 


d. 


d 2 + 1 V d 2 + 1 


d 2 + 1 J d 2 + 1 
[xi~ 2 }F 2 (x ] v 2] T 2 ) rn . 


If we put it all together, we find that p(j] T) is 


d\ 

di + 1 


__ ( d 2 \ m 

+ i kJ 2 + 1) 




m> 0 


hi 


di + 1 


■pO>i;Ti) + 


1 


di + 1 


X' 


J- 2 1 


^- 2 ] J F 2 (o:;u 2 ) m 

1 

d 2 + 1 — d 2 F 2 (x;v 2 ) 


Finally, if we multiply by x J and sum over j . we obtain the result stated. 

In part (d) one has P n (x) = x 2 /(2 — P n _ i(x)) for n > 2, with Pi = 1. If 
one assumes P n (x) = A n (x)/B n (x), then A n = x 2 B n - 1 and P n = 2B n -i — 
x 2 B n _ 2 . This leads to the result that 


P n (x ) = 


r!p 2 + r!“ 2 

r n-r +r n-i 


(n > 2) 
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where r± = 1 ± \f\ — x 2 . 

4. The sequence {e< m } is obviously generated by 

E{x) N(x — 1) 

1 — x 1 — x 

Since e> m = 1V(0) — e< m _i, it has the gf 

N( 0) _ E(x) _ NjO) - xN(x - 1) 

1 — X 1 — X 1 — X 


5. The board consists of only the diagonal cells of a full nxn board. To put 
k nonattacking rooks on this board we can choose any k of the n cells on the 


board, so rfc = (£). Then (4.2. 17) with j = 0 gives 


(-T^E 


fc =0 


(-iy 

k\ 


for the answer, in agreement with (4.2.10) 
6. We have 


E 

m> 0 


— 


x m ^(-l) r - m iV r 

m> 0 r>m 

£(-i r N r E (- 1 ) 

r>0 0 <m<r 

1 + (-l) r X r+1 


m x rn 


£(-iwf 

' r ' b > ri ^ 


r ->0 

1 


1 + X 


1 + X 


|e 0 + yy iv r ^ r+l | 

^ r>0 ' 


eo + xN(x) eo + xE(l + x ) 

1 + x 1 + x 

e 0 + x{e 0 + ei(l + x) H } 

1 + x 


Problem 7 is similar. 
8. The sum is 


1 + x + 


n 


(x 2 +x 6 ) + [ 2 )(x 4 + x & ) + --- 


n 

1 

2\n 


= (! + *)(!+(; )x^+( 2 )x 4 + ---) 


n 


= (1 + x)(l + x ) 
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If fm(y ) denotes the sum in question, then Snake Oil finds that 


fm(y)x m = (1 + x)(l + xy + x 2 ) n . 


See what happens if you try to extend this to the sum that results from 
replacing ‘|_r/2j’ by ‘|_r/3j’ in the sum to be found. 

9. The ‘objects’ Q are the A n possible ways of assigning colors to the vertices 
of G. For each edge e of the graph G there is a property P(e); a coloring has 
property P(e ) if the two endpoints of edge e have the same color. We seek 
the number of objects that have exactly 0 properties. 

Now consider IV(I) S ). For a given set S of edges, this is the number of 
colorings such that at least all of the edges in S are badly colored, i.e., have 
both endpoints the same color. Think of the graph Gs whose vertices are all 
n of the vertices of the graph G, together with just the edges in S. If all of 
the edges in S are badly colored, and if C is one of the connected components 
of Gs, then every vertex in C must have the same color. So the number of 
ways of assigning colors to the vertices of G such that the edges of S are 
badly colored is N(D S ) = A K ( s \ where k(S) is the number of connected 
components of the graph Gs- Hence 

P(\-x-G) = ^N r (x-l) r , 

r 


where 

N r = J2 XK{S) - 

\S\=r 


10. There are k n possible words, and we take these to be our set of objects 
Q. A word has property i if the substring w occurs in the word, beginning 
in position i of the word. Let S' be a given subset of properties, i.e., a set 
of places where the substring w is to begin. We seek IV(I) S), which in this 
case is the number of words of n letters, chosen from an alphabet of k letters, 
that have the substring w beginning in all of the positions indicated by S, 
and maybe elsewhere too. But there are no such words if two of the elements 
of S differ by < m, for then two occurrences of w would overlap, contrary to 
the hypothesis that they cannot do so. 

Hence we suppose that no two elements of S differ by < m. Then rm of the 
characters in the word are specified to be occurrences of w, where r = |<Sj. 
That leaves n — rm characters to be specified, and that can be done in 
N(P S) = k n ~ rm ways. Hence N r is k n ~ rrn times the number of subsets S 
of r elements of [n — m + 1] that have no two entries that differ by < m. But- 
how many such subsets are there? 
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Consider a subset S of r elements of [g], no two of whose entries differ by < m. 
If we delete the elements of S from [g], the remaining q — r integers are broken 
into r + 1 intervals of consecutive integers whose lengths are t 0 - 1\. ... .t r . say, 
where each ti > m — 1 for 1 < i < r — 1. The number of ways to choose such 
integers to, ... ,t r is clearly 


\x 


q-r 1 


1 -i' I I 1 — x 


x 


— l 'i r ~ i 


1 — X 


\x 


q-r] 


X 


(m— l)(r— 1) 


\X 


(1 - x) r+1 

q—r — (m—l)(i — 1 ) 


(i 

q — (m — 1 )(r — 1) 
r 



Thus, since q = n — m + 1, N r = k n rm ( n , and the number of w - free 

words is 

The Snake Oil method tells us that the answer is also the coefficient of x n in 

1 

1 — kx + x m 

In turn this suggests that it might have been easier to do this problem by 
finding a recurrence relation that is satisfied by the answer, instead of by 
using the sieve method, but we wanted to show you another example of the 
sieve method in which the A r (I) S )’ s do not depend only on the cardinality 
of the set S. 


13. This is an example where the Snake Oil method doesn’t work immedi- 
ately because the free parameter n appears too often in the summand. As in 


example 9. the thing to do is to generalize the problem, in this case to the 

B- 1 )' 


sum 


n 

n — m + k 


The latter responds nicely to Snake Oil, after multiplying by x m etc. Then 
set m = n. 


17. We find in part (a) that 


d 


fA 


dF 


— / F(x,y)dy= / ~K~dy 


dx 


' -B 


1 —B 


dx 

dG 


dy 


J-B dy 
= G(x,A)-G(x,-B) 


(. A , B — ¥ oo). 
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18 . 

(a) Since the sum of the d ' s is 2 n — 2, their average is 2 — 2 /n which is 
less than 2, so at least one of the cf s must be 1. We can suppose 
w.l.o.g. that di = 1. Then, in every tree whose degree sequence is 
A = (di, . . . , d n ), vertex 1 is connected to exactly one other vertex. 
There is an obvious 1-1 correspondence between the trees of degree 
sequence A in which vertex 1 is adjacent to vertex j, for some fixed 
j > 2, and the trees of n — 1 vertices 2,3, ... ,n, in which the vertex 
degrees are ( 0 ^ 2 , • • • , dj-i, dj — 1, dj+i , . . . , d n ). By induction on n, 
then, the number whose degree sequence is A is 




(n — 3)! 


^ (d 2 - 1)! ■ ■ ■ (d^ - 1 )\(dj - 2)! ■ ■ • (dn - 1)! 


E 


(n-3)!(dj - 1) 


~2 (d 2 — 1)\ ■ ■ ■ (dj — 1)\ ■ ■ ■ (d n — 1)! 


= ((2n-3)-(n-l)) 
(n — 2)! 

" (di — !)!•■• (d n — 1)! 


(n — 3)! 


(d 2 -!)!••• (d n -l)! 


as required. 


(b) By the multinomial theorem (see exercise 20 of chapter 2) 


F n (x !,..., x n ) = (xix 2 ■ ■ ■ x n )(xi -\ h x n ) 


n — 2 


(d) Let a tree T have property i if vertex i is an endpoint. If S C [n] then 
the number of trees of n vertices whose set of properties contains S 
is 

N(D S) = (n — |^|) n “ |s| “ 2 (n - \S \ ) |s| = (n - |^|) n “ 2 , 

since the first factor is the number of trees of n— |<Sj vertices and the 
second factor is the number of ways we can attach the .S' endpoints 
to such a tree. The result now follows from the sieve. 

(e) In the sieve method, the average number of properties that an object 
has is always Ni/N, which in this case is 


(n — l) n 2 n 
n n ~ 2 


n( 1 


^n-2 

n 


n 

e 


(a) Evidently we have for all T 

me s) = Y, ^(= n 

vcs 


19 . 


222 


Solutions 


by definition. Now substitute this for N ( C S)) under the summa- 
tion sign in the expression given on the right side of the statement 
of the problem, interchange the order of summation and verify the 
resulting identity. 

(b) It is 


V hn ^ x n 
n\ 

n> 0 


n«p 

ses 



Answers to problems for chapter 5 

1. Let t = 1/A, (j)(u) = 1 + u L , and f(u) = u. Then the equation u = t(f>{u) 
that is treated by the LIF becomes the present equation. The result follows 
after a small calculation involving the binomial theorem. 

3. In the LIF, choose the function f(u ) that satisfies f'(u ) = l/0(u). Then 

-[“"“‘ij/'wtu)"} = h«" _1 w«)" _1 - 

77/ I I 

On the other hand, if we write z(t) = f(u(t)), then 

[(”]/(«(()) = {t n ]z(t) = hr - 1 )z\t) = fir-'i/dh 

It It LLyLJ 

For the last equality of the problem, differentiate u = t<p(u) with respect to 
t. 

4. 

(a) Put 4>(u) = 1 + u + u 2 in the result of the previous problem to find 

that 7 n = [t n ] ; where u = u(t) satisfies u = t(l + u + u 2 ). 

By solving the quadratic equation for u and substituting, we find 
the result stated. 

(b) Let x = i/y/ 3 in problem 2. 

5. 

(a) Clearly S p (n) = [x n ] | (1 + x) pn /(l — x) \. 


(b) Take (j){u) — (1 + u) p and f(u) = 1/((1 — u)(l + u ) p ) in the LIF, 
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and find 

S» = 1 |.,n| (1 + W P( " +1) 

n + 1 n + 1 J (1 - u)(l + u)p 

= [< n+1 ]/(«(t)) 

= -XT [* n ]{/'(“(t))“'( f )} 

/ i I JL 

= 1 \+n] U ' (*) 

n + l [ J (l-u)(H-u) p 

1 un } tu'(t) 

n + 1 (1 — u(t))u(t) 

Since u = t(l + u) p , we find v! = and substitution leads 

to the result stated. 

(c) Equate coefficients of x n on both sides of the result of part (b). 


6. Let x ni , . . . , x n k be the powers of x who se coefficients in P(x) are strictly 


positive. Then, by Schur’s theorem 3.15.2, 


what is needed is that 


gcd(rn , • • • ,n k ) = 1. 


(a) It is 

(1 + n a ) n ~ n na exp ( n l ~ a - n l ~ 2a / 2 + ■••), 

where the argument of the exponential terminates after the last 
positive exponent of n is reached. 

(b) As above without the factor n na . 

(c) It is ~ 1. 


8 . 


Schur’s theorem 3.15.2 


(a) It is admissible because, by 

has positive coefficients from some point on. 


3 2:+z 2 /2+2: 3 /3 


9. Take f(u ) = (1 + u) k and (j){u ) = (1 + u ) 2 in the LIF. 
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